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Abstract. We formulate and prove a variational principle (in the sense of 
thermodynamics) for random domino tilings, or equivalently for the dimer 
model on a square grid. This principle states that a typical tiling of an arbitrary 
finite region can be described by a function that maximizes an entropy integral. 
We associate an entropy to every sort of local behavior domino tilings can 
exhibit, and prove that almost all tilings lie within e (for an appropriate metric) 
of the unique entropy-maximizing solution. This gives a solution to the dimer 
problem with fully general boundary conditions, thereby resolving an issue 
first raised by Kasteleyn. Our methods also apply to dimer models on other 
grids and their associated tiling models, such as tilings of the plane by three 
orientations of unit lozenges. 



The effect of boundary conditions is, however, not entirely trivial and 
will be discussed in more detail in a subsequent paper. 

P. W. Kasteleyn, 1961 



1. Introduction 



1.1. Description of results. A domino is a 1 x 2 (or 2x1) rectangle, and a tiling 
of a region by dominos is a way of covering that region with dominos so that there 
are no gaps or overlaps. In 1961, Kasteleyn [Kal| found a formula for the number 
of domino tilings of an m x n rectangle (with mn even), as shown in Figure |l| for 
m = n = 68. Temper ley and Fisher [TF] used a different method and arrived at the 
same result at almost exactly the same time. Both lines of calculation showed that 
the logarithm of number of tilings, divided by the number of dominos in a tiling 
(that is, mn/2), converges to 2G/n k, 0.58 (here G is Catalan's constant). On the 
other hand, in 1992 Elkies et al. [ EKLP[ | studied domino tilings of regions they 
called Aztec diamonds (Figure |2| shows an Aztec diamond of order 48) , and showed 
that the logarithm of the number of tilings, divided by the number of dominos, 
converges to the smaller number (log2)/2 « 0.35. Thus, even though the region in 
Figure ^ has slightly smaller area than the region in Figure ||, the former has far 
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Figure 1. A random domino tiling of a square. 



more domino tilings. For regions with other shapes, neither of these asymptotic 
formulas may apply. 

In the present paper we consider simply-connected regions of arbitrary shape. 
We give an exact formula for the limiting value of the logarithm of the number 
of tilings per unit area, as a function of the shape of the boundary of the region, 
as the size of the region goes to infinity. In particular, we show that computation 
of this limit is intimately linked with an understanding of long-range variations 
in the local statistics of random domino tilings. Such variations can be seen by 
comparing Figures |l| and |2| Each of the two tilings is random in the sense that the 
algorithm |PW| that was used to create it generates each of the possible tilings of 
the region being tiled with the same probability. Hence one can expect each tiling 
to be qualitatively typical of the overwhelming majority of tilings of the region 
in question, as is in fact the case. Figure |l| looks more or less homogeneous, but 
even cursory examination of Figure ^ shows that the tiling manifests different gross 
behavior in different parts of the region. In particular, the tiling degenerates into a 
non-random-looking brickwork pattern near the four corners of the region, whereas 
closer to the middle one sees a mixture of horizontal and vertical dominos of the 
sort seen everywhe re in Figur e |l| (except very close to the boundary). 

In earlier work | CEP , |JP^ two of us, together with other researchers, analyzed 
random domino tilings of Aztec diamonds in great detail, and showed how some of 
the properties of Figure || could be explained and quantified. It was proved that 
the boundary between the four brickwork regions and the central mixed region for 
a randomly tiled Aztec diamond tends in probability towards the inscribed circle 
(the so-called "arctic circle" ) as the size of the diamonds becomes large | JPS| , and 
that even inside the inscribed circle, the first-order local statistics (that is, the 
probabilities of finding individual dominos in particular locations) fail to exhibit 
homogeneity on a macroscopic scale [CEP]. 
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Figure 2. A random domino tiling of an Aztec diamond. 



Unfortunately, the techniques of [JPS] and |CEP| do not apply to general re- 



gions. A few cases besides Aztec diamonds have been analyzed; for example, ran- 
dom domino tilings of square regions have been analyzed and do turn out to be 



homogeneous [BP|. (That is, if one looks at two patches at distance at least d from 
the boundary of an rt x n square, with n even, the local statistics on the two patches 
become more and more alike as n goes to infinity, as long as d goes to infinity with 
n.) However, before our research was undertaken no general analysis was known. 

In this paper, we will demonstrate that the behavior of random tilings of large 
regions is determined by a v ariatio nal (or entropy maximization) principle, as was 



conjectured in Section 8 of | CEP |. We show that the logarithm of the number of 
tilings, divided by the number of dominos in a tiling of R, is asymptotic (when 
area(i?) ^ oo) to 

(1.1) sup//^^ent(|,|)dxd,. 

Here the domain of integration R* is a normalized version of R, the function h 
ranges over a certain compact set of Lipschitz functions from R* to M, and 

(1.2) ent(s, t) = -{L{-Kpa) + L{TTpb) + L{ttPc) + L{TTpd)), 

TT 
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where L(-) is the Lobachevsky function (see Q), defined by 
(1.3) L{z) = - [ log|2sint| dt, 



and the quantities PaiPb^PcPd are determined by the equations 

(1-4) 2{pa-pb) = t, 

(1.5) 2{pd-Pc) = s, 

(1.6) Pa+Pb+Pc+Pd = 1, 

(1.7) sin(7rpQ) sin(7rpb) = sin(7rpc) sin(7rpd). 

The quantities pa, Pb, Pc, Pd can be unde rsto od in terms of properties of random 
domino tihngs on the torus (see Subsection L2); the quantities also have attractively 
direct but still conjectural interpretations in terms o f the local statistics for random 
tilings of the original planar region (see Conjecture 13.2 ). 

There exists a unique function / that achieves the maximum in (^]^) . Its partial 
derivatives encode information about the local statistics exhibited by random tilings 
of the region; for example, the places {x,y) where the "tilt" {df /dx,df /dy) is 
(2,0), (—2,0), (0,2), or (0,-2) correspond to the places in the tiling where the 
probability of seeing brickwork patterns goes to 1 in the limit, in a suitable sense. 
The function / need not be C°°, and in fact often is not. For example, in the case of 
Aztec diamonds / is smooth everywhere except on the arctic circle, where it is only 
(except at the midpoints of the sides, where it is only C*'). Inside this circle, / 
takes on a variety of values, corresponding to the fact that different local statistics 
are manifested at different locations. In contrast, for square regions the function / 
is constant, corresponding to the fact that throughout the region (except very close 
to the boundary), the local statistics are constant. (See Figures and ||.) 

In general, the locations where / is not smooth correspond to the existence of 
a phase transition in the two-dimensional dimer model, closely related to a phase 
transition first noticed by Kasteleyn |Ka2]. 

The function / is related to combinatorial representations of states of the dimer 
model called height functions. Each of the different tilings of a fixed finite simply- 
connected region in the square grid has a height function which is a function from 
the vertices of the region to Z satisfying certain conditions (see Subsection 1.4); 
there is a one-to-one correspondence between tilings and height functions (as long 
as we fix the height at one point, since the actual correspondence is between tilings 
and height functions modulo additive constants). All of the height functions agree 
on the bound ary o f the region but have different values on the interior. In an 



earlier paper [ CEP , it was shown that these height functions satisfy a law of large 
numbers, in the sense that for each vertex v within a very large region, the height of 
z; in a random tiling of the region is a random variable whose standard deviation is 
negligible compared to the size of the region. This suggested the existence of a lim it 
law, but did not indicate what the limit law was. We show (see Theorem 1.1 at 



the end of Subsection 1.2): for every e > 0, the height function of a random tiling, 
when resettled by the dimensions of the region being tiled, is, with probability tending 
to 1, within e of /, where / is the function that maximizes the double integral in 
( [l.l| ). We therefore call / an asymptotic height function. One may qualitatively 
summarize our main results by saying that the pattern governing local statistical 
behavior of uniform random tilings of a region is, in the limit, the unique pattern 
that maximizes the integral of the "local entropy" over the region. Moreover, the 
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value of this maximum is an asymptotic expression for the "global entropy" of the 
ensemble of tilings of the region. 

Our main theorem (Theorem 1.1) thus has two intertwined components: a law of 
large numbers that describes the local statistics of random tilings of a large region 
by way of an asymptotic height function /, and an entropy estimate that tells us 
that the total number of tilings is determined by a certain functional of that self- 
same function /. Furthermore / is precisely the unique Lipschitz function that 
maximizes that functional. 

We supplement our main theorem by two supporting results: a large deviations 
estimate (Theore m |1.3[ ) and a PDE that the entropy-maximizing function / must 
satisfy (Theorem 12.1). To be technically correct, we should not assert that the 
function / satisfies the PDE everywhere, but only where the partial derivatives are 
continuous and the tilt {s,t) satisfies \s\ + \t\ < 2. This proviso is necessary because 
singularities in / (corresponding to domain boundaries like the arctic circle) are 
an essential feature of the phenomena we are studying; in fact, we shall find (see 
Section ^ and Theorem ) that these singularities are related to a phase transition 
manifested by the dimer model as an external field is permitted to vary, and that 
domain boundaries can be viewed as a spatial expression of that phase transition. 

Our results also imply that if instead of studying the uniform distribution on the 
set of all the tilings of a large region R, one restricts one's attention to those tilings 
whose height functions (suitably normalized) approximate some asymptotic height 
function h (which need not be the h that maximizes the double integral (1.1)), 
then the entropy of this restricted ensemble (that is, the normalized logarithm of 
the number of tilings) approximates the value of the double integral. Note that 
this more general result implies that the total number of (unrestricted) tilings is at 
least as large as the supremum (|]^). As for integrand itself, we remark here that 
ent(s,t) achieves its maximum at (0,0) and that it goes to zero as {s,t) goes to the 
boundary curve |s| + \t\ = 2; that is, there are many ways to tile a patch so that 
its average tilt is near zero, but fewer ways to tile it as the tilt gets larger, and no 
ways to tile it at all if the desired tilt (s, t) fails to satisfy \s\ + \t\ < 2. 



1.2. Interpretation. A dimer configuration, or perfect matching, of a graph is a 
set of edges in the graph such that each vertex belongs to exactly one of the selected 
edges. To see the equivalence between tilings by dominos and dimer configurations 
on a grid (the graph-theoretic dual to the graph of edges between lattice squares) , 
one need only replace each 1x1 square by a vertex and each domino by an edge. 

To explain the significance of the quantities Pa,Pb,Pc,Pd, we need to digress 
and discuss the dimer model on a torus. Here the graph is just like the m x n 
rectangular grid, but with m extra bonds that connect vertices on the left and 
right and n extra bonds that connect vertices on the top and bottom. Kasteleyn 
showed [Kal] that the number of dimer configurations on this graph is governed 
by the same asymptotic formula as for the dimer model on a rectangle. In the 
same article Kasteleyn considered a generalization to non-uniform distributions 
on the set of tilings, obtained by assigning different "weights" to horizontal and 
vertical bonds, and letting the probability of a particular dimer configuration be 
proportional to the product of the weights of its bonds. In the statistical mechanics 
literature, such modifications of a model are sometimes conceived of as resulting 
from the imposition of an external field. 
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Here we go one step further, and impose a field that discriminates among four 
different kinds of bonds: a-bonds and 6-bonds (both horizontal) and c-bonds and 
d-bonds (both vertical) , staggered as in Figure || from Section ^ (this is different 
from a 4-parameter external field that was considered by Kasteleyn |Ka2]). The 
field depends on four parameters a, 5, c, d, which are the weights associated with 
the respective kinds of bonds; the probability of a dimer configuration on the torus 
graph is proportional to the product of the weights of its bonds. Taking the graph- 
theoretic dual, we get a non-uniform distribution on domino tilings on the torus, 
having less symmetry than the torus itself. One way to motivate the consideration 
of such asymmetrical measures is to observe that in regions like the one shown in 
Figure ^, the boundary conditions break the symmetry of the underlying square 
grid in precisely this fashion, yielding subregions in which bonds (or dominos) of 
one of the four types predominate. 

To avoid unnecessary complexity, we limit ourselves to square tori (m — n) , but 
the situation for more general tori is much the same. 

The quantities Pa,Pb,Pc,Pd have a direct (and rigorously proved) interpretation 
in terms of the dimer model on the nx n torus. Suppose we are given positive real 
weights a, b, c, d as described above. Suppose that each of a, b, c, d is less than the 
sum of the others. Then there is a unique Euclidean quadrilateral of edge lengths 
a, c, 6, d (in that order) which is cyclic, that is, can be inscribed in a circle. Define 
Pa to be f/(27r) times the angle of arc of the circumscribed circle cut off by the 
edge a of this quadrilateral. Similarly define Pb,Pc and pd- Then we shall see (see 
Theorem |8.3| ) that pa is (in the large-n limit) the probability that a given a-bond 
belongs to a randomly-chosen dimer configuration (under the probability distribu- 
tion determined by the weights a,b,c,d), and likewise for pb,Pc,Pd- Technically, we 
only prove convergence for n in a large subset of N, but we believe that convergence 
holds for all n. If on the other hand a > b + c + d, then we shall see that as n — oo, 
Pa tends to 1 (and pb,Pc,Pd tend to zero). A similar phenomenon occurs when b, 
c, or d is greater than or equal to the sum of the others. 

Moreover, the quantities s = 2{pd — Pc) and t = 2{pa — Pb) have a height- 
function interpretation. Since the torus can be viewed as a roUed-up plane, every 
dimer configuration on the torus "unrolls" to give a dimer configuration on the 
plane, which (in the guise of a domino tiling of the plane) gives rise to a height 
function. The height is not in general a periodic function, but rather increases by 
some amounts and Hy as one moves n vertices in the -I- a; direction or n vertices 
in the +y direction. Here and Hy depend on the tiling chosen, and thus are 
random variables; the expected values oi Hx/n and Hy/n are s and i, respectively. 

The relationship between the uniform measure on tilings of finite simply-connected 
regions and the a, 6, c, d- weighted measure on tilings of tori may become clearer after 
one has verified that the a, 5, c, d- weighted measure on the tilings of a finite simply 
connected region, defined in the obvious fashion, is nothing other than the uniform 
distribution, as long as ab = cd. To see this, one may make use of the fact that any 
domino tiling of such a region can be obtained from any other tiling by a sequence 
of moves, each of which consists of applying a 90-degree rotation to a pair of domi- 
nos that form a 2 x 2 block 0. Such a move trades in an a-domino and a 6-domino 
for a c-domino and a d-domino, or vice versa. The condition ab = cd then guar- 
antees that the two tilings have the same weight. Since such moves suffice to turn 
any tiling into any other, all the tilings have the same weight, and the probability 
distribution is uniform. This property is called "conditional uniformity," because 
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if one conditions on the tiling outside a simply-connected region, the conditional 
distribution on tilings of the interior is uniform. 

We do not propose any concrete interpretation for the weights a, b, c, d, and 
it might be unreasonable to ask for one, given that the probability distribution 
on matchings determined by a,b,c,d is unaffected if all four are multiplied by a 
constant. There is a choice of scaling that makes most of our formulas relatively 
simple, namely, the scaling that makes the arcsines of the four weights add up to 
TT, or equivalently, the scaling that makes the product 

{a + b + c - d){a + b - c + d)(a - b + c + d)(-a + b + c + d) 

equal to the product 

4(a6 + cd){ac + bd){ad + be) 
(this is most easily seen from Theorem p.3[ ). However, it is worth mentioning at 
least one instance in which the scaling sin" a + sin"^ b + sin"^ c + sin"^ d — tt does 
not give the simplest possible formula for a, 6, c, d. Specifically, consider the nor- 
malized Aztec diamond with vertices at (±1, 0) and (0, ±1). The arctangent law of 



CEP I gives one way of expressing how Pa,Pb,Pc,Pd vary throughout the normalized 
diamond (corresponding to the fact that the respective densities of north-, south-, 
east-, and west-going dominos vary throughout large Aztec diamonds). However, 
an even more compact way of stating this dependence is via the formulas 



a — 




+ T/)2 — 


b = 


v/(i 


— nY — 


c — 




-\- xY — 


d ^ 


v/(l 


— x)^ — y2 



1.3. Sketch of proof and preliminary statement of results. The strategy 
behind our proof of the main theorem is roughly as follows. In the first few (and 
more qualitative) sections of the article (Sections ^ through]^), we cover the set of 
all domino tilings of the region by subsets which are balls in the uniform metric on 
height functions (that is, each subset consists of tilings whose height functions are 
approximately equal); each ball is associated with an asymptotic height function 
h from a bounded subset of j^to M. Appealing to quantitative results proved 
later in more technical sections of the article (Sections ^ through ^l]), as well as 
to combinatorial arguments, we show that the l ogar ithm of the cardinality of a 



ball is approximated by the double integral in (O) times half the area of the 
region. However, we also show that there is a unique ball in our cover for which 
the corresponding h maximizes the double integral, and that the contribution that 
this ball makes to the total number of tilings swamps all the other contributions. 

In Section 0, we compute the "partition function" for tilings (that is, the sum 
of the weights of all the tilings). This computation relies on Kasteleyn's original 



work [Kal]. The next few sections of the article, on which the first few sections 
depend, are close in spirit to Kasteleyn's original work on the dimer model on the 
torus. (Indeed, the local entropy ent(s,t) is equal to the asymptotic entropy for 
dimer covers of the n x n torus as n goes to infinity, where the edges have been 
assigned new weights as described above, favoring those tilings that have tilt near 
{s,t).) Using Kasteleyn's method of Pfafhans, we show that the dimer model in 
the presence of weights a, 5, c, d exhibits a phase transition when any of the four 
weights equals the sum of the other three, and our method of analysis also requires 
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that special attention be given to the case in which a = b and c — d (which was 
analyzed by Kasteleyn). The calculations are lengthy, but the end results, obtained 
in the final sections of the paper, are satisfyingly simple. For direct formulas for 
Pa,Pb,Pc,Pd in terms of s = df/dx and t = df/dy, see (L8) below. 

Here is a loose statement of our result. See Theorems |l.3| and 4.3 for more 
precisely quantified statements, and the remainder of Section ^ for the relevant 
definitions. 



Theorem 1.1. Let R* be a region in bounded by a piecewise smooth, simple 
closed curve dR* . Let /if, : dR* — s- R 6e a function which can be extended to a 
function on R* with Lipschitz constant at most 2 in the sup norm. Let f : R* 
be the unique such Lipschitz function maximizing the entropy functional ^iit(f) (see 
equations (l.S) and (1.9)), subject to f\dR* = hf,. (See Section^ for the proof of 
the asserted uniqueness.) 

Let R be a lattice region that approximates R* when rescaled by a factor of 
l/n, and whose normalized boundary height function approximates hb. Then the 
normalized height function of a random tiling of R approximates f , with probability 
tending to 1 as n oo. 

Furthermore, let g : R* ^ W be any function satisfying the Lipschitz condition. 
Then the logarithm of the number of tilings of R whose normalized height functions 
are near g is n?{Fiiit(g) + o(l)). 

In his thesis |^ , HofFe derives (in a less rigorous manner) expressions equivalent 
to our ( |7.20| ) and (|8j). In ^^^], Destainville, Mosseri, and Bailly set up a similar 
framework to our variational principle, but they use a quadratic approximation to 
the entropy, and they do not supply rigorous proofs. Readers of this article might 
also wish to read |Ke2] and [P2|. These two articles cover some of the same ground 
as this one, but with more of an emphasis on phenomena and less concern with 
proofs. 

Before we can continue our discussion of the variational principle and state the 



main result more precisely (Subsection 1.6), we need to review some background 
on entropy and height functions. 



1.4. Height functions. Height functions are a geometrical tool discovered in indi- 
vidual cases by Blote and Hilhorst | BH ] and Levitov [Q and independently studied 
by Thurston @], who situated the idea in a less ad hoc, more general context. 
Given a (connected and) simply-connected region R that can be tiled by dominos, 
domino tilings of R are in one-to-one correspondence with height functions on the 
set of lattice points in R, defined up to an additive constant. The height function 
representation is useful because the difference between the heights of two vertices 
encodes non-local information about a tiling. Here we will quickly summarize the 
basic definitions and properties of height functions. For a more extensive discussion 
aimed at applications to random tihngs, see Subsections 6.1-6.3 and Section 8 of 
|CEP |; for a geometrical point of view, see 

Color the lattice squares in the plane alternately black and white, like a checker- 
board. (It does not matter which of the two ways of doing this is used.) Define a 
horizontal domino to be north-going or south-going according to whether its left- 
most square is white or black, and define a vertical domino to be west-going or 
east-going according to whether its upper square is white or black. These names 
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come from the domino shuffling algorithm from | EKLP | , which is the main combi- 
natorial tool used to study Aztec diamonds. We will not use domino shuffling, but 
the division of dominos into these four orientations will nevertheless be important, 
as will the coloring in general. (It is intended that north-going dominos correspond 
to a-edges, south-going to 6-edges, east-going to c-edges, and west-going to d-edges. 
The geometrical terminology is more pleasant when we have no weights in mind.) 

Let i? be a lattice region (i.e., a connected region composed of squares from 
the unit square lattice), and let V be the set of lattice points within R or on 
its boundary. We will always assume that R can be tiled by dominos in one or 
more ways. A height function h on R is a function from V to Z that satisfies the 
following two properties for adjacent lattice points u and v on which it is defined. 
(We consider two lattice points to be adjacent only if the edge connecting them is 
contained in R.) First, if the edge from u to v has a black square on its left, then 
h{v) equals h{u) -I- 1 or h{u) — 3. Second, if the edge from m to u is part of the 
boundary of R, then \h{u) — h{v)\ = 1. 

Given a height function h on R, consider the set of all pairs of adjacent lattice 
points u,v with \h{u) — h{v)\ = 1; then it is not hard to check that the set of all 
dominos in R that are bisected by such an edge taken together constitute a tiling 
of R. 

Given a domino tiling of a simply-connected region R, we can reverse the process, 
and find a height function that leads to the tiling. Such a height function always 
exists, but it is not quite unique, because one can add a constant (integer) value 
to it everywhere to get another such height function. To avoid this ambiguity, we 
fix the height of one lattice point on the boundary of R. Then height functions 
satisfying this constraint are in one-to-one correspondence with domino tilings. 

From this point on, when we use the term lattice region, we will always implicitly 
assume that one of the lattice points on the boundary has a specified height, and 
when we talk about height functions we will always assume that they satisfy this 
condition. Notice that when the region is simply-connected, our assumption takes 
on an especially pleasant form, because it is then equivalent to fixing the heights 
along the entire boundary. (However, when the boundary is in several pieces, the 
situation is more complicated. In this paper, we will typically assume that our 
regions are simply-connected, although we will always mention that assumption 
when we make it.) 

To define a height function for a domino tiling of an n x n torus, view the torus 
as an n X n square with opposite sides identified, and view this square as sitting 
centered inside an (n+2) x (n+2) square. A tiling of the torus determines a covering 
of the smaller square by dominos that lie inside the larger square. (Dominos that 
jump from one side of the small square to the other correspond to two dominos in 
the large square; the rest correspond to just one.) Then define the height function 
on the n X n square as for a lattice region in the plane. This height function may 
not be well defined on the torus, since its values can be different at two identified 
vertices, but this will not matter f or our purposes. This definition is not really 
natural, but it is convenient. (See |STCR| for a more natural definition of height 
functions on tori. We will not use their definitions or results.) 

Let i? be a lattice region, and let H{R) be the set of all height functions on R. 
We define a partial ordering < on H{R) by setting hi < /12 if hi{u) < h2{u) for 
every lattice point u G R. 
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The set H{R) is not just a partially ordered set, but also a lattice. The join of 
two height functions hi and can be defined by [hi V h2){u) = max(/ii(u), /i2(u)), 
and their meet by {hi A /i2)(u) — min(/ii(tt), ft,2(u)). To show that H{R) is a 
lattice, we need to show that hi V ft.2 and hi A /12 are height functions. Notice that, 
at each lattice point u, the value of height functions at u is determined modulo 4 
(independently of the specific tiling) ; this assertion is trivial when u is the point at 
which we have fixed the values of height functions, and if it is true at some particular 
lattice point, then the definition of a height function immediately implies that it 
is true at all adjacent lattice points. Thus, if hi{u) is unequal to h2{u), then they 
differ by at least 4. It now follows from the definition of a height function that if 
hi{u) > /i2(u), then hi{v) > h2{v) for all lattice points v adjacent to u. Hence, 
given any two adjacent lattice points, hi V /i2 (or hi A /12) agrees with hi at both 
points, or agrees with /12 at both of them, whic h is what is needed to show that 
hi V /i2 and hi A /12 are height functions. (See [Pl| for details and more general 
results.) 

As Figures and |^ demonstrate, the precise boundary conditions of a region 
can have a dramatic effect on the behavior of typical tilings. Height functions 
provide the proper tool for gauging the effect of boundary conditions on random 
tihngs. (Proposition 20 of |CEP[| is one way to make this claim precise. It says, 
roughly, that the height function of a typical tiling depends continuously on the 
heights on the boundary of the region.) For example, one can compute the rate 
of change of the height function in terms of the probabilities of finding dominos in 
given locations. If a region has statistically homogeneous random tilings, then the 
typical height function should be nearly planar. Because the boundary heights for 
Aztec diamonds are highly non-planar, typical tilings cannot be homogeneous. 

Define the average height function of a lattice region R to be the average of 
all height functions on R. (Of course, it is almost never a height function itself.) 
Theorem 21 of | CEF[ implies (roughly) that if R is large, then almost all height 
functions on R approximate the average. Thus, the problem of describing typical 
tilings is reduced to the problem of describing the average height function. 



1.5. Entropy. The entropy of a random variable that takes on n different values, 
with probabilities pi, ■ ■ ■ ,Pn, is defined as ~Pi^ogpi (with OlogO = 0). For a 

uniformly distributed random variable, the entropy is simply the logarithm of the 
number of possible outcomes. We will nevertheless often need to deal with entropy 
for a non-uniform distribution. In general, we denote the entropy of a random 
variable X by ent{X); there should be no confusion with our use of "ent(-)" to 
denote local entropy depending on tilt. 

We will sometimes speak of the conditional entropy of a discrete random variable, 
conditional upon some event; this is just the entropy of the conditional distribution 
determined by the conditional probabilities. 

The only fact about entropy that we will need other than the definition is the 
following standard fact about conditional entropy (the proof is straightforward). 

Lemma 1.2. Suppose X is a random variable that takes on values xi, . . . ,Xn, and 
suppose that {xi, . . . , Xn} is partitioned into blocks _Bi, . . . , Bm- Let B be the ran- 
dom variable that tells which block X is in, and let Xi be the random variable 
that takes on values in Bi according to the conditional distribution of X given that 
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X (z Bi. Then the entropy of X is given by 

m 

ent{X) = ent(B) + ^ Prob(a; € B,)ent{X,). 
1=1 

When we deal with entropy for tihngs of a lattice region R, we will always 
normalize it by dividing by half the area of R, so that we measure the information 
content per domino. Thus, the normalized entropy of a set of tilings of a lattice 
region R (under the uniform distribution) is the logarithm of the number of tilings 
in the set, normalized by dividing by the number of dominos in a tiling of R. When 
we refer to the entropy of a region R, we mean the entropy of the set of all tilings 
of R, under the uniform distribution. 

1.6. The variational principle. Let R* be a region in bounded by a piecewise 
smooth, simple closed curve OR*. (We will always assume our curves do not have 
cusps.) Suppose i? is a large, simply-connected lattice region. We can normalize 
by scaling the coordinates by a factor of 1/n (for some appropriate choice of n). 
Suppose that the normalization of R approximates R* (in a sense to be clarified 
below). If we scale the values of height functions by dividing their values by n, then 
for large n the normalized height functions that one obtains approximate functions 
on R* that satisfy a Lipschitz condition with constant 2 for the sup norm. The 
reason for this is that if u and v are lattice points at sup norm distance at most d 
from each other within R (i.e., they are connected by a path within R of length d, 
where the allowable steps are lattice edges or diagonal steps) and h is any height 
function, then one can check that \h {u) — h{ v)\ < 2d~\- 1. (One can prove this claim 



directly, or deduce it from Lemmas 3.1 and |3.3| .) 

Whenever we refer to a 2-Lipschitz function f from a subset of to R, we mean 
one that is locally Lipschitz with Lipschitz constant 2 for the sup norm, i.e., its 
domain is covered by open balls in which every pair of points {xi,yi) and (x2,j/2) 
satisfy 

\f{xi,yi) - f{x2,y2)\ < 2max(|a;i - X2\, \yi - y2\)- 
Notice that where such a function / is differentiable, it must satisfy 

\df/dx\ + \df/dy\ < 2. 

Conversely, every differentiable function satisfying this condition is 2-Lipschitz. 
We call {df /dx, df /dy) the tilt of the function (and use this terminology whether 
or not / is Lipschitz). It is important to keep in mind Rademacher's theorem 
(Theorem 3.16 of |^]), which says that every Lipschitz function is differentiable 
almost everywhere. 

Suppose that dR* is a simple, closed, piecewise smooth curve in M? that bounds 
a region R* . We say that a function hb defined on dR* is a boundary asymptotic 
height function if there exists a 2-Lipschitz function h on R* such that — h^, 

and we call such an h an asymptotic height function. Let AH(i?*, hb) be the set of 
all asymptotic height functions on R* with boundary heights hb] notice that this 
set is convex and that it is compact with respect to the sup norm. 

Before continuing, we need to specify exactly what it means for one region 
bounded by a simple closed curve to approximate another. This can be defined 
by the Hausdorff metric on closed subsets of R^; specifically, two regions are de- 
fined to be within e of each other if the e-neighborhood of each one's boundary 
curve contains the other's. 
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We also need to discuss what approximation to within e means when there are 
boundary asymptotic height functions on the curves. Given regions Ri and R2 with 
boundary asymptotic height functions hi and /12, we say that (i?i, hi) is within e 
of (i?2, ^2) if for all xi £ dRi there exists a. X2 & 9i?2 such that d{xi, X2) < s and 
\hi{xi) — h2{x2)\ < £, and vice versa. 

For technical reasons, it is most convenient to write out our arguments in the 
case of approximation from within, where we have a sequence Ri , R2^ • • • of regions 
in the interior of R* whose limit is R* . In most cases, this is easily arranged. 
The regions i?2, . . . will be rescaled lattice regions, and if R* is a star-shaped 
region, then by adjusting the scaling we can assume approximation from within, 
without changing our asymptotic results. However, when R* is not star-shaped, 
slightly more is required. There are two ways to deal with such domains. First, 



Proposition 20 of |CEF| tells us that average height functions behave continuously 
when boundary heights are perturbed, and this lets one adjust regions so that their 
normalizations fit within the limiting region, without changing the asymptotics. 
Second, one can give a direct proof by cutting a general region into star-shaped 
pieces. Thus, we will be able to assume approximation from within in later sections 
without loss of generality. 

In the case of approximation from within, it will be convenient to use a slightly 
different definition of £-approximation. If R2 is in the interior of with boundary 
height functions hi and /12, then we say that (i?i, hi) is within e of (i?2, ^12) if their 
boundaries are within e, and for every height function h on Ri extending hi, the 
restriction of h to 9i?2 is within e of /12. This clearly generates the same topology 
as the definitions above, but it is more convenient to work with, so we will use it 
in later sections. 

We saw in the first paragraph of this subsection that if R approximates R* when 
rescaled, then normalized height functions on R approximate asymptotic height 
functions on R* (because of the Lipschitz condition on height functions). Later, 



Proposition 3.2 will tell us that every asymptotic height function is nearly equal to 
a normalized height function; that is, the class of asymptotic height functions has 
not been defined too broadly. 

We will show that the average height function on R is determined by finding the 
asymptotic height function / that maximizes the integral of local entropy, which 
depends on the tilt of /. 

Given (s,t) satisfying |s| -I- |i| < 2 (i.e., a possible tilt for an asymptotic height 
function), define the local entropy integrand ent(s,t) as follows. First define 
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Pc — ~ 7 + 77- cos ^ I "V"/ -.^-.y..-./ ~, I ^ 



where the values of cos ^ are taken from [0, tt]. We set a = sin(7rpa), ^ = sin(7rpi 



6 



c = sin(7rpc), and d = sin(7rpd) when \s\ + \t\ < 2, and we define ent(s, t) as in (1.2) 
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We will see later that there is good reason to believe that the numbers Pa,Pb,Pc,Pd 
correspond to the local densities of the four orientations of dominos. However, we 
do not have a proof. 

Define the entropy functional on AH(i?*, hb) by 

Suppose that Ri, R2, ■ ■ ■ is a sequence of simply-connected lattice regions with spec- 
ified boundary heights, such that when _R„ is normalized by n (or, say, a constant 
times n), as n — + cxd the boundary converges to dR* and the boundary heights to 
hb (as specified above). Without loss of generality, we can assume that the nor 



malized boundaries of i?2, • . • all lie within R* . We know from Proposition 2.4 



that there is a unique / e AH(i?*, hj,) that maximizes Ent(/). We will prove (in 



Theorem 4.3) that the entropy of tilings of i?„ whose normalized height functions 
are close to / (that is, the logarithm of the number of such tilings, divided by half 
the area of i?„) is Ent(/) -I- o(l) as n — > 00. 

We can now prove a sharpened version of the claim made in the second paragraph 



of Theorem 1.1 



Theorem 1.3. For each e > 0, the probability that a normalized random height 
function on Rn differs anywhere by more than e from the entropy-maximizing func- 
tion f is exponentially small in r? (i.e., is bounded above by r" for some r <1). 

Proof. Cover AH(i?*,/ib) with open sets around each asymptotic height function 
h, such that the entropy for normalized height functions in these sets is strictly 
less than Ent(/) unless h — f . By compactness, only finitely many of the sets are 
needed to cover AH(_R*,ft,h); we include the one corresponding to h = f. Then 
Theorem [4.3| implies that for n large, the probability that a random normalized 
height function on i?„ docs not lie in the open set around / is exponentially small 
inn^. □ 

Theorem RTS provides a much stronger large deviation estimate than the best 



previous result (Theorem 21 of |CEP]). In addition, it gives the first proof that 
under these conditions the normalized average height function of Rn converges to 
f as n 00. (It was previously not known to converge at all, nor was a precise 
characterization of the entropy-maximizing function known.) 

It is worth pointing out th at all o ur results generalize to tilings with unit lozenges 
(as studied in, for example, ||CLP| ). The combinatorial results for lozenge tilings 
(or, equivalently, the dimer model on a honeycomb graph) are straightforward mod- 
ifications of those we present here for domino tilings; the analytic results are special 
cases of ours (set one of the four weights a, b, c, d equal to to move from weighted 
domino tilings of tori to weighted lozenge tilings). 

In Sections || through |J, we will need the following three facts proved later in 
the article: 

1. For every tilt (s, t) satisfying \s\ \t\ < 2, there exist unique weights a, b, c, d 



(up to scahng) that satisfy ah = cd and give tilt (s,t) (see Section p.2\) . 

If an n X n torus has edge weights a, 6, c, d such that ab = cd and the tilt 

is (s,t), then the normalized entropy (for the probability distribution on the 



tilings) is ent(s,f) -I- o(l) as n — > 00 (Theorem 9.2) 



The local entropy function cnt(-, •) is strictly concave and continuous as a 



function of tilt (Theorem 10.1 
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2. Analytic Results on the Entropy Functional 

In this section, we will phrase all our results in terms of the local entropy in- 
tegrand, but they will depend only on its continuity and strict concavity. (Thus, 
if desired, the results can be applied in different circumstances, for example to 
other sorts of tiling problems.) Everything in this section is fairly standard ma- 
terial from geometric measure theory and the calculus of variations; for example, 



Theorem 5.1.5 of |Fe[ is a more sophisticated version of Lemma 2.1. However, we 
will give complete proofs, partly to make this part of the paper self-contained and 
accessible, and partly because some of what we need does not seem to appear in 
the literature in quite the form we would like. 

In what follows, hb denotes a particular boundary asymptotic height function and 
h ranges over the asymptotic height functions that restrict to hh on the boundary 
of the region R. 



Lemma 2.1. The functional Ent : AH(i?*,/if,) M, defined (as above) by 
"^"^^'^ ^ II (S' 



is upper semicontinuous. 



For the proof of Lemma 2.1, as well as other applications later in the paper, we 
will need to know how well we can approximate an asymptotic height function h by 
a piecewise linear function. The simplest way to achieve such an approximation is 
as follows. For i > 0, look at a mesh made up of equilateral triangles of side length 
i (which we call an i-mesh). Consider any piecewise linear, 2-Lipschitz function 
h that agrees with h at the vertices of the mesh and is linear on each triangle. 
(Of course, h depends on the mesh as well as on h. It is uniquely determined on 
each triangle that lies completely within R* , but not on those that extend over the 
boundary.) 

Lemma 2.2. Let h be an asymptotic height function, and let e > 0. If £ is suffi- 
ciently small then, on at least a 1 — s fraction of the triangles in the (.-mesh that 
intersect R* , we have the following two properties. First, the piecewise linear ap- 
proximation h agrees with h to within le. Second, for at least a 1 — e fraction (in 
measure) of the points x of the triangle, the tilt h'{x) exists and is within e ofh'{x). 

Keep in mind that h'{x) is the vector of partial derivatives of h at x; it does not 
matter which norm we use to measure the distance between tilts, but for the sake 
of specificity we choose the (.2 norm. Notice that the second property implies that 
Ent(/i) = Ent(/i) + o(l) (that is, Ent(/i) Ent(/i) as e 0). (Keep in mind that 
\\h'\\i<'2.) 

Proof. We begin with the first of the two properties. Recall that Lipschitz functions 
are differentiable almost everywhere (Rademacher's theorem) |Q. For any point 
X at which h is differentiable, we have \h{x -\- d) — {h{x) -\- h'{x) ■ d)\ < s\d\/2 if \d\ 
is sufficiently small, say \d\ < r with r > (where r depends on x). If x lies within 
an equilateral triangle of side length £ with £ < r, then on that triangle we have 
the approximation property we want, because there the two functions d h{x-\-d) 
and d I— *■ h(x -\- d) (the unique linear function that agrees with g on the corners of 
the triangle) both lie within et/2 oi d^ h{x) + h'{x) ■ d. 
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Given p > 0, let Sp be the set of all x such that r (depending on x as above) 
can be taken to be at least p. Take p small enough that the measure of Sp is at 
least (1 — e/3) times area(i?*). (We can do that since h is differentiable almost 
everywhere.) 

Now take i < p. Look at any ^-mesh, and the piecewise linear approximation 
h we get from it. If i is sufficiently small, then all but a o(l) fraction of the mesh 
triangles lie entirely within the region. At least a 1 — e/3 — o(l) fraction of them 
must intersect Sp, which proves that the desired approximation property holds on 
at least a 1 — e/3 — o(l) fraction of the triangles. 

For the second property, we will apply a result on metric density (see Section 7.12 
of 10). Let J7i, . . . , [/„ be open subsets covering the set of possible tilts such that 
any two tilts contained within the same subset differ by at most e, and for 1 < i < n 
let Vi — {x : h'{x) G Ui}. It follows from the theorem on metric density that (for 
each i) if S is sufficiently small, then for all but an e/3 fraction of the points x € Vi, 
at least a 1 — c fraction of the ball of radius S about x lies in Vi (and thus the tilt at 
those points differs by at most c from h'{x)). Now we can take 5 small enough that 
this result holds for all i from 1 to n, and then as above it follows that for t < 6, 
al — e/3 — o(l) fraction of the mesh triangles in any £-mesh lie entirely within R* 
and satisfy the second property. 

Thus, if I is sufficiently small, at least a 1 — e fraction of the triangles satisfy 
both properties (since al — e/3 — o(l) fraction satisfy each). □ 



Lemma 2.3. Suppose that dR* is an equilateral triangle of side length £, and that 
the asymptotic height function h satisfies \hf) — h\ < s£ on dR* , where h is some 
linear function. Then 

Ent{h) < Ent{h) + o(l) 

as e 0. 



Proof. Because ent(-) is concave, 

area(i?*) JJ \ dx' dy 



(2.1) ^— — ■ / / ent ( ^ ) dxdy < ent{h^^i,^, hy^^^). 



where 



and 



'^^^av = 11 ^dxdy 

area(it*) J J jj* ox 



We have 



(^a;,av; ^y,av) — (^a;,avi hy^^iv^ ~t~ 0(e) 

(as one can see by computing the average by integrating over cross sections and 
applying the fundamental theorem of calculus), and hence 



ent(/ia;^avi hy,av 

) = ent(ft,j;,av, hy^av) + o(l), 



with 



since ent(s,t) is a continuous function of (s,t). Now combining (2.1 

Ent(/i) = area(i?*)ent(/ia;,av, ^y,av) 
yields the desired result. □ 
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Proof of Lemma \2.^ . Let h be any asymptotic height function, and consider the 
neighborhood Us of h consisting of all asymptotic height functions within 6 of h. 
We need to show that given e > 0, if (5 is sufficiently small, then for all g ^ Us, 
Ent(g) < Ent(/i) +e. 



Let s' > 0. It follows from Lemma 2.2 that if £ is small enough, then the piecewise 
linear approximation h coming from an l-mesh satisfies |/i — < e'£ on all but an 
e' fraction of the triangles of the mesh, and Ent(/i) = Ent(ft,) + o(l) as e' 0. Let 
6 < e'l. Then if 5 G Us, \g — f \ < 2e'£ on all but an e' fraction of the triangles. 



and by Lemma 2.3 the contribution to Ent(g) from the non-cxccptional triangles 
is at most o(l) greater than the corresponding contribution to Ent(ft,). Of course, 
the remaining e' fraction of the triangles contribute a total of 0{e'). Hence, 

Ent(5) < (1 - 0(e'))Ent(/i) + o(l) + O(e') = Ent(/i) + o(l). 

By choosing e' sufficiently small, one can make the o(l) term less than e. Thus, 
Ent(-) is upper semicontinuous. □ 

Proposition 2.4. There is a unique asymptotic height function f G AH(i?*,ft,f,) 
that maximizes Ent(/). 

Proof. For existence, we can use a compactness argument, since AH(i?*,/if,) is 
compact. Because the local entropy integrand is bounded, Ent(-) is bounded above, 
and we can choose a sequence hi, h2, ■ ■ ■ such that Ent(/ii) approaches the least 
upper bound as i — s- 00. By compactness, there is a subsequence that converges, 
and by upper semicontinuity, the limit of the subsequence must have maximal 
entropy. 

Now uniqueness is easy. Suppose that /i and /2 are two asymptotic height 
functions that maximize entropy, with f\ ^ fi- 

Their derivatives cannot be equal almost everywhere, so for some e > we must 
have 



\ dx ' dy J 2 
on a set of positive measure, by the strict concavity of ent(-). It follows that 
Ent [il±Jl] > Ent(/i) + Ent(/2) 



which contradicts the assumption that Ent(/i) and Ent(/2) were maximal. (Notice 
that since /i and /2 are asymptotic height functions, so is (/i + /2)/2.) Therefore, 
only one asymptotic height function can maximize entropy. □ 

3. Combinatorial Results on Entropy 

In this section, we will compare the entropies of several regions with nearly the 
same shape, but slightly different boundary conditions. To deal with this sort of 
situation, we will use partial height functions. Let i? be a simply-connected lattice 
region. A partial height function on R is an integer-valued function h defined on a 
subset of the lattice points in R, including all the lattice points on the boundary, 
such that h satisfies the following condition. If u and v are adjacent lattice points 
on which h is defined, such that the edge from m to w has a black square on its left, 
then h{v) — h{u) is 1 or —3. We call h a complete height function if it is defined 
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on all the lattice points in R, and we say that a complete height function is an 
extension of a partial height function if they agree where both are defined. We call 
a partial height function a boundary height function if the set of lattice points on 
which it is defined is the boundary of R and there exists a complete height function 
extending it. 

Define a free tiling (or a "tiling with free boundary conditions" ) of a region as 
a covering of the region by dominos, no two of which overlap, and each of which 
contains at least one cell belonging to the region. A free tiling is just like a tiling, 
except that the tiles are allowed to cross the boundary. Given a boundary height 
function h, extensions of /i to i? correspond to tilings where dominos cross the 
boundary of R at certain specified places (namely the places where the height 
changes by ±3). In this way h singles out a certain subset of the free tilings of 
R. We could phrase all of our results in terms of free tilings, but partial height 
functions will be a more convenient formulation. 

Suppose we have a boundary height function on R, such that there is an extension 
to a complete height function on R. We can determine the maximal extension H 
(under the usual partial ordering) as follows. For any lattice point v in R, look at 
boundary points w and paths tt joining w to v such that every edge of tt (oriented 
from w towards v) has a black square on its left. Call such a path an increasing 
path, since it if does not cross any dominos, then the height increases by 1 after 
each edge. Define a decreasing path analogously. 

Lemma 3.1 (Fournier). For each lattice point v ^ R, define i?max('^) o-s the min- 
imum of h{w) + lcngth(7r), where w ranges over all boundary lattice points and tt 
ranges over all increasing paths in R from w to v, and define -ffmin('i') o-s the max- 
imum of h{w) — length(7r), where this time tt ranges over all decreasing paths in 
R from w to V. Then -ffmax is the maximal extension of h to R, and iJmin is the 
minimal extension. 



For a proof, see |Fo|. Notice that Lemma 3.1 implies that if one changes the 
boundary height function h by at most a constant c at each point, then iJmax and 
-ffmin change by at most c as well. 

We can now prove a proposition we will need later, that connects asymptotic 
height functions to actual height functions. Suppose i? is a domino-tileable lattice 
region, such that rescaling i? by a factor of 1/n gives a region whose boundary 
lies close to a region R* , where dR* is a piecewise smooth, simple closed curve. 
Without loss of generality, we can suppose that the normalized region lies within 



R* (as discussed in Subsection 1.6) 



Proposition 3.2. Given an asymptotic height function f which is within e of the 
normalized boundary heights of R, there is an actual height function on R whose 
normalization is within e + 0(l/n) of f . 

Proof. Let g be the largest height function on R whose normalization is less than 
or equal to /, i.e., the lattice sup of all height functions below /. (Technically, we 
must pick a lattice point and restrict our attention to height functions that give 
it height modulo 4. This ensures that all our height functions are equal modulo 
4 and thus that the lattice operations are well defined.) Then all values of g are 
within 8 of what one would get simply by un-normalizing /, because if one takes 
any lattice point, assigns it a value that is correct modulo 4 and at least 4 below 
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the un-normalized value of /, and looks at the minimal height function taking that 
value there, then the Lipschitz constraint on / implies that one stays below /. 

The height function g will typically not have the correct boundary values on R. 
Instead, it will have boundary heights corresponding to some different boundary 
height function &, although they will be within en + 0{1) of the actual boundary 
heights for R. 

To fix this problem, let h be the minimal height function on R, and let H be the 
maximal one. Then consider (g\/ h) AH, which is a height function with the correct 
boundary values for R. It follows from Lemma that h and H differ by only 
en + 0(1) from the minimal and maximal extensions h' and H' of b, respectively. 
Thus, since {g V h') A H' ^ g, it follows that {g V h) A H differs by only en + 0(1) 
from g. Because the normalization of g differs from / by only 0(l/n), we see that 
the normalization of {g\/h)AH differs from / by at most e + 0{l/n), as desired. □ 



Lemma 3.3. The minimal increasing path length (with the path not restricted to 
lie in any particular region) between two lattice points is always within 1 of twice 
the sup norm distance between them. 

Proof. Start at any lattice point and work outwards, labelling the other points with 
the lengths of the shortest increasing paths to them. It is easy to prove by induction 
that on the square at sup norm distance n from the starting point, on two opposite 
sides the lengths alternate between 2n and 2n + 1, and on the other two sides they 
alternate between 2n and 2n — 1. □ 

We will prove the next three results in more generality than we need, in order 
to make the precise hypotheses clear. We will need to apply them only to regions 
whose boundaries closely approximate a. k x k square or an equilateral triangle 
of side length k, such that the heights along the boundary are nearly planar (in 
particular are fit by a plane to within ek for some fixed e > 0). 

Proposition 3.4. Let e > 0. Suppose R is a simply- connected lattice region of 
diameter at most n (i.e., every lattice point in R is connected to every other by a 
path within R of length n or less ), such that the heights on the boundary of R are 
fit to within en by a plane with tilt {s,t) satisfying \s\ + \t\ < 2. Then the average 
height function is given to within 0{en) by that plane (if n is sufficiently large). 

Here "0(£n)" simply denotes a quantity bounded in absolute value by a fixed 
constant times en, for sufficiently large n. 

Proof. Consider a large torus, with edge weights a,b,c,d chosen to give tilt (s,t) 



and satisfying ab = cd (see Subsection 9^), and view R as being contained in 



the torus. We will look at random free tilings of R generated according to the 
probability distribution on weighted tilings of the torus. If we fix the height of 
one point on the boundary of R, then the average height function for these tilings 
is given (exactly) by a linear function of the two position coordinates, so that its 
graph is a plane. 

If we select a random tiling from this distribution, then with probability differing 
from 1 by an exponentially small amount, the heights along the border of the patch 



stay within en of the plane, by Proposition 22 of |CEP|. (It is not hard to check 



that all the large deviation results from Section 6 of |CEP|, such as Propositions 20 



and 22, apply to random tilings generated this way, not just from the uniform 
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measures on tilings of finite regions. All that matters is conditional uniformity, in 
the sense that two tilings agreeing everywhere except on a subregion are equally 
likely to occur; conditional uniformity follows from ab = cd.) Consider all possible 
boundary height functions on R that stay within en of a plane. The average of the 
corresponding average height functions, weighted by how likely they are to occur 
in the weighted probability distribution, is within o(l) of the plane (for n large). 

By Proposition 20 of JCEP , all boundary height functions that stay within en 
of the plane have average height functions within 0{en) of each other. Since the 
average is within o(l) of the plane, each must be within 0{en) of it. This completes 
the proof. □ 



Lemma 3.5. Let e > 0. Suppose R is a horizontally and vertically convex lattice 
region of area A with at most n rows and columns, such that n < eA. Assume that 
the boundary heights are fit to within eA/n by a plane with tilt (s, t). If \s\ + \t\ > 
2 — e, then the entropy of R is 0(elogl/£), if A is sufficiently large. 

Proof. Without loss of generality, suppose that s and t are positive. 

Consider any vertical line through R on which the x-coordinate is integral. If the 
segment that lies within R has length fc, then for every tiling of R, the difference 
between the number of north-going dominos bisected by the line and the number 
of south-going dominos bisected by it is -t- 0{eA/n) + 0(1), as one can see by 
considering the total height change along the segment. Similarly, on a horizontal 
segment of length fc, the number of west-going dominos bisected minus the number 
of east-going ones bisected is sfc/4 -I- 0{eA/n) + 0(1). If we add up all of these 
quantities, the error term becomes 0{eA) + 0{n), which is 0{eA) since n < eA. 

The total number of dominos in any tiling of R is A/2. By the results of the 
previous paragraph, this quantity is also twice the number of east-going or south- 
going dominos plus (s/4 -I- t/4)A -I- 0{eA). We have (s/4 + t/4)A > ((2 - e)/A)A. 
Hence, the total number of east-going or south-going dominos is 0{eA). 

Every tiling is determined by the locations of its east-going and south-going 
dominos. (To see why, recall that superimposing the matchings corresponding to 
two tilings gives a collection of cycles. Any disagreement between the two tilings 
yields a cycle of length at least 4, which must contain an east-going or south-going 
edge that is in one tiling but not the other.) Hence, there are at most 



tilings, since there are 0{eA) possibilities for the number of east-going or south- 
going dominos, at most 

A 

^0{eA)^ 



places to put them, and at most 



20{eA) 



ways to choose which are east-going. It is not hard to check that Stirling's formula 
implies that 

^°^(o(£A))-^«^l°Sl/^)^)- 
Therefore, the entropy of R is 0(elogl/£). □ 
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Proposition 3.6. Fix k > 0. Let e > 0, and let R be a horizontally and vertically 
convex region of area A with at most n rows and columns, such that kn^ < A and 
n < eA. Suppose h is a boundary height function on R that is fit to within eA/n 
by a fixed plane. Then for A sufficiently large and e sufficiently small, the entropy 
of extensions of h to R is independent of the precise boundary conditions, up to an 
error of 0{s^^'^ log 1/e). (The entropy does depend on the tilt of the plane, and this 
proposition says nothing about whether it depends on the shape of R. Notice also 
that the dependence on k is hidden within the implicit constant in the big-O term.) 

Proof. Let (s, t) be the tilt of the plane. If \s\ + \t\ > 2 — e^/^, then the conclusion 



follows from Lemma 3.5. Thus, we can assume that the tilt satisfies |s| + |t| < 

2- £1/2. 

Suppose g is another boundary height function on R, which agrees with the same 
plane as h, to within 0{eA/n). We need to show that extensions of g and h have 
nearly the same entropy. Without loss of generality we can assume that g > h, 
since otherwise we can go from g to g \/ h and from h to g \/ h. 

Given any extension / of g, let H{f) be the infimum of / and the maximal 
extension of h, so that H{f) is an extension of h. Similarly, given any extension / 
of h, let G{f) be the supremum of / and the minimal extension of g, so that G(/) 
is an extension of g. 

The maps H and G are not inverses of each other, but they come fairly close to 
being inverses. Given an extension / of h, H{G{f)) agrees with / at every lattice 
point except those with heights less than or equal to their heights in the minimal 



extension of g. By Lemma 3.1, the minimal extension of g is within 0{eA/n) of 
the minimal extension of h, so these points have heights within 0(eA/n) of their 
minimal heights. Similarly, given an extension / of g, G{H{f)) agrees with / at all 
lattice points that are not within 0{sA/n) of their maximal heights. 

By assumption, the tilt (s,t) of the plane satisfies \s\ + |t| < 2 — e^^^. Thus, the 
height difference between any two points in the plane is bounded by 2 — gi/^ times 



the sup norm distance between them. Therefore, Lemma 3A and Lemma 3^ imply 
that the extreme heights at a point differ from the heights on the plane by at least 
ffi/^ times the sup norm distance to the boundary. 

Look at all lattice points not within sup norm distance {e^^^ log l/e)A/n of the 



boundary. By Proposition 3.4, at any such point the average height for extensions 
of 5 or /i is within 0{eA/n) of the plane (notice that because fcn^ < A < n'^, being 
0{en) is the same as being 0{eA/n)), and the extreme heights differ from the plane 
by at least (e log l/e)A/n, so the extreme heights differ from the average heights by 



an amount on the order of {elogl/e)A/n, for e small. By Proposition 22 of | |CEP |, 



the probability that any such point will have height within 0{eA/n) of its extreme 
heights is exponentially small in n. 

Thus, given any point not within sup norm distance (£i/^ log l/e)v4/n of the 
boundary, the probability that H o G or G o H will not be the identity at that 
point is exponentially small. It follows that with probability nearly 1, H o G and 
G o H are the identity except on at most 0((ei/^ log 1/e) A) lattice points. Thus, 
the numbers of extensions of g and h differ by at most a factor of 

40((ei/^logl/e)A)^ 

SO the entropy of extensions of to i? differs from that of extensions of h by 
0(£i/2logl/e). □ 
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4. Proof of the Variational Principle 

Theorem 4.1. Let e > 0. Suppose R is an n x n square, with a boundary height 
function h fit to within en by a plane with tilt {s,t) satisfying \s\ + \t\ < 2. Then 
for n sufficiently large, the entropy of extensions of h to R is 

ent(s,t) + 0(ei/2 logl/e), 

as is the entropy for free boundary conditions staying within en of the plane (i.e., 
the entropy for the set of all free tilings of R whose boundary heights stay within 
en of the plane). 



Proof. We know from Proposition |3^ that the entropy is independent of the precise 
boundary conditions, but we still need to prove that it equals ent(s, t). To do so, we 
will compare with an n x n torus that has edge weights a, b, c, d satisfying ab — cd 
and yielding tilt {s,t). (We can suppose that |s| + |i| < 2, since otherwise the result 



follows from Lemma |3.5| and Proposition 3.6.) The torus is obtained by identifying 



opposite sides of R, so that tilings of R give tilings of the torus, but not vice versa. 
Keep in mind that because of the weighted edges in the torus, the probability 
distribution on its tilings will not be uniform. However, the equation ab = cd 
implies conditional uniformity (as mentioned earlier), so if we fix the behavior on 
the boundary of R, then the conditional distribution on extensions to the interior 
will be uniform. 



In Section H, we will define a set C N depending on (s,t). By Lemma 3.1 



for sufficiently large even n, either n or n + 2 is in W. In Proposition 9.1, we show 
that the entropy of n x n tori with tilt (s, t) converges to ent(s, i) as n — > oo in W. 

First we will suppose that n e W, so that the entropy of the n x n torus is 
ent(s,t) + o(l). 



It follows from Proposition 22 of |CEP| that with probability exponentially close 



to 1, in a random tiling of the torus, the heights on the boundary of the square 
will be fit to within en by the average height function, which is a linear function 
with tilt (s,i). Th e nu mber of toroidal boundary conditions is exponential in n. 



and by Proposition 3.6 each has about the same entropy (except ones that are not 
nearly planar, but they are very unlikely to appear). Lemma |l.2| tells us that the 
entropy of the torus equals the average of the entropies for the different boundary 
conditions, plus a negligible quantity for large n (since we have normalized by 
dividing by half of the area n^). Because all the nearly planar boundary conditions 
have the same entropy, the torus must as well, so since we know it has entropy 
ent(s, t) +o(l) as n oo, each of the nearly planar boundary conditions must have 
entropy ent(s,t) + 0(e^/^ logl/e). (Since e is fixed as n oo, we can absorb the 
o(l) into the big-O term.) 

Now it is easy to deal with the case oi n ^ W. If n ^ W, then n + 2 E W and 
71 — 2 g W. The entropies for tihngs of {n — 2) x {n~ 2), n x n, and (n + 2) x {n + 2) 
squares with nearly planar boundary conditions are nearly the same. (To prove 
that, embed an (ti — 2) x (n — 2) square into an n x n one, and an n x n one 
into an (n + 2) x (n + 2) one, extending the boundary conditions arbitrarily. Then 
the number of tilings increases with each embedding, so the entropy of the n x n 
square is caught between the other two, to within a 1 + o(l) factor coming from the 



differing areas. By Proposition 3.6, this result holds for all nearly planar boundary 
conditions.) Note that the same argument as above goes back from squares to the 
torus, thus proving entropy convergence for all n (not just n G W). 
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The claim about free boundary conditions follows easily (since the number of 
boundary conditions that stay within en of the plane is only exponential in n, and 
all of them have about the same entropy). □ 



Corollary 4.2. Theorem j| holds if R is an equilateral triangle of side length n, 
instead of a square. 

Proof. It is not hard to check that equilateral triangles satisfy the hypotheses of 



Proposition 3.4, Lemma 3.5, and Proposition p.6\ Thus, we just need to deal with 
the case of a equilateral triangle whose boundary heights are within 0(1) of being 
planar. 

To prove that the entropy of the triangle is at least what we expect, tile the 
triangle with smaller squares, such that their boundary heights are within 0(1) of 
being planar. (Of course, those near the edges will stick out over the boundary, 
but if n is large enough, we can make the squares small enough compared to the 
triangle that only an e fraction of the squares will cross the boundary.) Except 
for an error of 0(e) from the squares that cross the boundary, the entropy of the 
triangle is at least the entropy of the squares, which is what we wanted. 

An analogous argument (involving tiling a square with triangles) shows that the 
entropy of the triangle is at most what we expect. □ 

For the next theorem, we will use the same setup as in Proposition ^.2| . 

Theorem 4.3. Let R* be the region bounded by a simple closed curve, and let ht be 
a boundary height function on R* . Suppose R is a simply- connected lattice region 
such that when R is normalized by a factor of 1 /n, it approximates R* to within 
6, and its normalized boundary heights approximate the region R* with boundary 
heights hb to within S; we assume that the normalization of R lies within R* . Given 
an asymptotic height function h G AH(i?*, hh), the logarithm of the number of tilings 
of R whose normalized height functions are within 0{d) of h is the area of R times 

Ent(/i) +o(l) 

as 5 (for n sufficiently large). 

Proof. Notice that the set of tilings whose normalized height functions are within 



0{6) of h is non-empty, by Proposition 3.2. Call the set of such tilings Us 



Fix £ > 0. Choose £ small enough that we can apply Lemma 2.2 to the piecewise 
linear approximation h to h derived from an ^-mesh (with approximation tolerance 
e). Then, as is pointed out after the statement of Lemma |]^, Ent(/i) — Ent(/i) + 
o(l). We will take S < £e, and show that the entropy we want to compute is 
Ent(/i) +o(l). 

We know that \h — h\ < ie on all but at most an e fraction of the triangles in 
the mesh. Those triangles can change the entropy by only 0{e), so we can ignore 
them. We can also ignore the 0{5) fraction of the triangles that do not lie within 
the normalization of R (which change the entropy by 0{S) = 0{e)). We will call 
the triangles within the normalization of R on which |/i — /i| < £e the included 
triangles, and the others the excluded triangles. 

Let g be any element of Us . The entropy of Us is bounded below by the sum over 
all included triangles of the entropy of g restricted to that triangle, plus the 0{e) 
contribution from the excluded triangles. It is bounded above by the same sum 
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(including the 0{e)), but with free boundary conditions on the included triangles 
(subject to the condition of staying within ie of /). 



We can now apply Corollary 4.2, It tells us that each included triangle's contri- 
bution to the entropy of Us is approximately equal to its contribution towards the 
entropy of h. It follows that our upper and lower bounds for the entropy of Us both 
equal Ent{h) + 0{e) + ©(e^/^ log 1/e). This gives us the desired conclusion. □ 



Note that Theorem 4.3 implies Theorem 



1.1 



5. Overview of Remaining Sections 

In Section |^ we compute the partition function Z„(a, 6, c, d) for matchings of 
the toroidal graph Gn — 1? jlnl? with Ar? vertices. In Section |7.2| we compute 

the limit, as n — > oo, of Z^^*-^" In Section ^ we compute the limit of the edge- 
inclusion probabilities for edges of each type, with respect to the measures /Lt„, and 
also a bound on the variance of the number of edges of a fixed type in G„. This 
computation is only done for n in an infinite subset C N. Since the variance is 
o(n*), the measure is concentrating near tilings with the mean number of edges of 
each type. This fact allows us, in Section ^, to compute the limit for n oi the 



entropies. As explained in the proof of Theorem 4.1, it follows that the limit for 
arbitrary n — > oo must be the same as the limit for n € W . In Section |l^ we show 
that the entropy is strictly concave as a function of the tilt (s, t). In Section |l^ we 
present the PDE which a entropy-maximizing Lipschitz function must satisfy 
(at least in the distributional sense). 



6. The Partition Function 

Let the graph G be the infinite square grid. Define an a-edge to be a horizontal 
edge whose left vertex has even coordinate sum, a 5-edge to be a horizontal edge 
whose left vertex has odd coordinate sum, a c-edge to be a vertical edge whose lower 
vertex has even coordinate sum, and a d-edge to be a vertical edge whose lower 
vertex has odd coordinate sum. Let a, 6, c, d be four non-negative real numbers. 
Weight the a-edges with weight a, the 6-edges with weight 6, and so on. (For 
comparison with our earlier, more geometrical terminology, a-edges are north-going, 
&-edges south-going, c-edges east-going, and d-edges west-going; in other words, 
points with even coordinate sum correspond to white squares.) 

We assume without loss of generality that a > b, c > d and a> c. 

For n an even positive integer let Gn denote the quotient of G by the action 
of translation by (2n,0) and (0, 2n). Then Gn is a graph on the torus and has 
vertices and edges (2ri^ edges of each type). A vertex (a, 5) of G (where 
a,b G [0, 2n — 1]) will be denoted in what follows not by an ordered pair (a, b) but 
rather by an ordered triple (x, y, i), where x = [a/2j , y = \ b/2\ , and t = 1,2, 3, or 
4 corresponding to (a, b) being congruent to (0, 0), (1, 1), (1, 0), or (0, 1) modulo 2. 

The partition function Z„ is by definition the sum, over all perfect matchings of 
Gn, of the product of the edge weights in the matching: 

Zn{a,b,c,d)^ J2 a'^°6'^^c^^d'^^ 

matchings 

where Na is the number of matched a-edges, etc. 
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There is a natural probability measure /i„ = /in (a, b, c, d) on the set of all match- 
ings, where the probability of a matching that has Na a-edges, etc., is 

The physical interpretation of /i is the following. Let Ea, E^, Ec, and Ed be energies 
associated with "dimers" on an a-edge, 6-edge, c-edge, and c?-edge, respectively. De- 
fine weights a, 6, c, d (also called "activities" in the statistical mechanics literature) 

by 

where /? is a constant depending on the temperature. Then fi is the Boltzmann 
measure associated to these energies; specifically, the probability of a configuration 
of energy E is proportional to e~^^ , where E is the sum of the energies of the 
individual dimers. 

In what follows we will take a, 6, c, d as the fundamental quantities and will not 
deal with f3 or temperature as such. 

Our concern will be with the situation in which n goes to infinity with the field- 
parameters a,b,c,d fixed: the so-called "thermodynamic limit". Even though for 
each finite n, Zn{a,b, c, d) is a smooth function (indeed a polynomial function) of 

the 4-tuple (a, 6, c, d), the limit Z(a, 6, c, d) — lim„^oo Zn^^'^^ •* and other thermody- 
namic quantities need not be. We will see that Z{a, b, c, d) is everywhere but not 
in the vicinity of the locus of (a-|-5-|-c— d)(a+6— c+(i)(a— 6+c+d)(— a+ft+c+d) = 

0. 

It's worth pointing out that the 4-parameter field determined by (a, 6, c, d) actu- 
ally only has two meaningful degrees of freedom: one degree of freedom drops out 
because of the imposition of the constraint ab = cd, and the other drops out by 
virtue of the fact that multiplying a, b, c, d by a constant has no effect on any of the 
quantities of interest. These two degrees of freedom correspond to the two degrees 
of freedom associated with (s, t) (the tilt). 

7. Determinants 



Th e go al of this section is to compute Z„: we use Proposition 7.1 and equations 



([7j)-(|7^) below. 

Given an enumeration of the 4n^ vertices of Gn, we define the weighted adjacency 
matrix of Gn as the 4n^ x 4n^ matrix whose «,jth entry is the weight of the edge 
connecting vertex i to vertex j (interpreted as if there is no such edge). Define a 
matrix Ai by multiplying the weights on vertical edges of the weighted adjacency 
matrix of Gn by i = The matrix A2 is obtained from Ai by multiplying 

by —1 the weights on the vertical edges from vertices {j,n — 1,4) to (j, 0,1) and 
edges from vertices {j,n — 1,2) to (j,0,3) for all j S [0, n — 1]. The matrix A3 
is obtained from Ai by multiplying by —1 the weights on horizontal edges from 
vertices {n — 1, fc, 2) to (0, k, 4) and horizontal edges from vertices {n — 1, k, 3) to 
(0, fc, 1), for fc S [0, n — 1]. The matrix A4 is obtained from Ai by multiplying by 
1 the weights on both these sets of edges. By the method of Kasteleyn Kal| , we 



have the following proposition. 

Proposition 7.1. Fori = 1,2,3,4 the quantities det A,; are non-negative, satisfy 
Zn > V dot Ai , and satisfy 

Zn{a,b,c,d) = i (^-^/dctAi + \/ det Ai + ^det A3 + ^det . 
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Let V denote the set of vertices of G„. The matrix Ai operates on in the 
following way: for / e and w £ V, 



^vw fv 



Let Tf^j^k) be the linear operator on corresponding to the translation by (j, k) 
on Gn- The operators T^2.o) 8'iid 2^(0,2) commute with each other and with Ai. The 
eigenvalues of T(^2.o) are e^'^'-'/" for integers j e [0, n — 1] . The eigenspace of T(^2.o) for 
eigenvalue e^'^*-'/" is 4n-dimensional: a vector v in this eigenspace is determined by 
its coordinates in two consecutive columns of Gn- Similarly r(o,2) has eigenvalues 
^2-n-ik/n vcctor in the e^'^*'^/"-eigenspace is determined by its coordinates in 

two consecutive rows of Gn- The intersection of a maximal eigenspace of T(^2,o) 
and one of ^(0,2) is 4-dimensional: a vector in the intersection is determined by its 
coordinates on a 2 x 2 square of vertices (as in Figure ^, vertices vi,V2,V3, v^j 
Let z = e^'^*/". For ( 7, k) e [0, n 



1]^ and s £ {1, 2, 3, 4} define a vector Cj'^l by 



(7.1) 



if t — s, and 
otherwise. 




The e^"^ are in the intersection of the 2;^ -eigenspace of 7(2,0) and the z'^-eigenspace 
of 7(0,2)- Let 5 be the 4n^ x 4n^ matrix whose columns are these eigenvectors: 



(7.2) 



' "=-0.0' "^1.0' ■ 



Note that S satisfies S ^ = ^S'*, where * denotes the transpose. 

Because both 7(2,0) and 7(0.2) commute with Ai, S~^AiS has the block-diagonal 



form 



(7.3) 



S-'^AiS^ 



( ^0.0 





V 





^1,0 




Bn 



^-1 / 



with 4x4 blocks Bj,k for j, k £ [0, n — 1] . The block Bj,k is the action of A\ on the 
intersection of the z-' -eigenspace of 7(2_o) and the z'^-eigenspace of 7(o^2)- We have 



(7.4) 



B 



'3,k 





i{d + cz'') 

a + bz^ i{d + cz~^) 
\ i{c + dz^) b + az-^ 



a + bz ^ i{c + dz *=) \ 



\- az-' 





for the ordering {e^^^I, . . . , e^^],} (see Figure ^. 

Since the upper right and lower left 2-by-2 subdeterminants of Bj^k are complex 
conjugates, the determinant of Ai is 



det ^1 



W \2ab + a^z-3 



b^z^ + 2cd + c^z-^ 



d^zH 



j=0 k=0 



26 



HENRY COHN, RICHARD KENYON, AND JAMES PROPP 



Z ■'V2 



\ t 

- Vi b ^2 a — z'vi 



2"^W3 —h ""1 a "^3 b — z^vi 

I I 

Z ''V^ Z '^V2 

Figure 3. A vector v in the intersection of the z^-eigenspace of 
T(2^o) 8.nd the z'^-eigenspace of T(^o,2)- 



so 



(7.5) 



det^i 



nn 

j=0 k=0 



k\2 



{a + bz^y [c + dz^) 



z^ 



The matrices A2, A^, A4 have similar determinants. For example for A2, let 
i?(o,2) E^ct on by translation by (0, 2) followed by negation of the coordinates in 
the first two rows, that is, vertices (j, 0, s) for < j < n — 1 and s S {1,2,3,4}. 
Then i?"^ 2) = i?(o,2) ^^'^ -^(2,0) commute with A2. One thus finds that 



71 — 1 71 — 1 



(7.6) det(A2) = n n 

and similarly 



(a + 6e2"^/")2 (c + de'^'^(2fe+i)/n-)2 



p27rij/ri 



o7ri(2fe+l)/n 



(7.7) det(A3) 



(7.8) det(A4) 



nn 

j=Q k=0 



(a + 6e^'(2i+l)/„)2 ^ ^g2^»fe/n-)2 



g7ri(2j + l)/n 



^2-Kik/n 



nn + ■ (c+de"'(2fe+i)/„)2 

j=0 fe=0 



g7ri(2j + l)/n 



oiri(2fe+l)/n 



(7.9) 

(7.10) 

(7.11) 



P2 = 



Pi = 



We show that, with the exception of Ai, the square roots of det are polynomials 
in a, 6, c, d. Define 

j=0 fc=0 

n_lri_l , i,„TTi(2^ + l)/n\2 („ , ^„27rife/n\2- 

j=0 k=0 ^ 

, ;,„7ri(2.7 + l)/n\2 , ^„7ri{2fe+l)/n\2 - 

nn 



(a + 5e'^'(2J+i)/«)2 (c + de^'^*'^/")^ 



g7rt(2j + l)/n 



a7ri(2fe+l)/Ti 



The functions P2,P3 and P4 are polynomials in a,b,c,d. Note that in ( [7.9[ ), the 
involution (j, fc) 1-^ (— j, — fc + 1) maps each term to its complex conjugate (and this 
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involution has no fixed point). Therefore P2 = \J det Ai . Similarly P3 — \/ det 
and P4 = Vdet A^. 
Define 

where the + sign holds when a < 6 + c + d and the — sign holds when a > 6 + c + d. 
The involution (j, fc) i-^ (— j, — fc) maps each term in (7.12) to its complex conjugate; 
the product of the terms which are fixed under this involution is 

((a + hf + (c + df){~{a - hf + (c + df)(ia + hf - (c - df){{a - hf + (c - d)^). 

This product is positive or negative depending on whether a<5 + c + (i or a> 
& + C + C?. Thus on the domain a < b + c + d, Pi is a polynomial (taking non- negative 
values) , and on the domain a > 5 + c + d, Pi is the negative of this polynomial 
(and this polynomial also takes non-negative values). Note that Pi = \/det A\. 
(The quantity Pi is defined similarly for all positive values of a, 6, c, d; when one of 
a,b,c,d is greater than the sum of the other three, the — sign is used, otherwise 
the + sign is used.) 

From Proposition 7.1 we have 

(7.13) Z„ = i(-Pi + P2 + P3 + Pi). 

7.1. Eigenvalues and roots. Here we study in greater detail the function 

(a + bz)'^ (c + dw)'^ 
z w 

where a, b, c, d are non-negative reals. 
Let 



/ \ , (a + bzY 
r{z) =cd+ ' 



Then 

(7.14) q{z,w) 
where 



2z 

-I- 2r(z)u; -f d^iy^ {dw ~ ac){dw ~ Pc) 



w w 



(7.15) a{z),P{z) = ^±^(^^^ -1. 

We will choose (3{z) to be the larger root (in modulus). 

Let ^(z) = {a + bzY /z and ri{w) = —{c + dw)'^/w, so that £_{z) — ri{w) = q{z, w). 
The critical points oi £^{z),ri{w) are respectively z = zLa/b and w = zLc/d. Recall 
our assumption that a > b and c> d. 

Lemma 7.2. The map ^ maps the punctured unit disk {z : |z| < 1, z 7^ 0} 
injectively onto the exterior of the ellipse Ei whose major axis has endpoints — (a — 
&) ^ , (a -f 6) ^ and whose minor axis has endpoints 2ab -\- i(a^ — 6^),2a6— i(a^ — b^). 
The ellipse Ei has foci and Aab and center 2ab. In the case a — b, this ellipse 
degenerates to the line segment [0,4a6]. The map rj maps the punctured unit disk 
{\w\ < 1, w 7^ 0} injectively onto the exterior of the ellipse E2 with major axis 
— (c + d)^, (c — d)^, minor axis —2cd ± i(c^ — d^) , and foci at Q and —Acd. 

See Figures |and|. 
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Figure 4. The ellipses Ei and E2 in the case a <b + c + d. 




Figure 5. The ellipses Ei and E2 in the case a > b + c + d. 



Proof. Recall that a > 6 by hypothesis. Suppose a > b. We have ^{z) = a? j z + 
b'^z + 2ab. For |2| = 1 write z = cost + isint. Then 

^{z) = a'^{cost — isint) + b"^ {cost + isint) + 2ab 
= 2a6+(a^ + 6^)cost-i(a^-6^)sint. 

Thus as z runs counterclockwise around S^, ^(z) runs clockwise aroiind the ellipse 
El. Since ^ has no critical points in the unit disk, it is injective on the unit disk, 
mapping it to the exterior of Ei (with mapping to 00). 

When a = b, El degenerates to a segment; still, ^ maps the punctured open unit 
disk injectively onto the exterior of the segment. 

The case of rj is similar. □ 
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The function q can also be written 
(7.16) ^-(^,+bV-^ + ^(^ + dV^))(^^+tV-^-^(^ 



The coefBcients of a, b, c, d in either term of (7.1(;) have modulus 1 when |z| = 1 = 
Suppose a> b + c + d. Then g(e^^ e*"^) ^ for aU 9 and 0, so n E2 = 0, 
and hence E2 is contained inside the bounded region delimited by Ei. See Figure 
^. For each fixed z satisfying \z\ — 1, ^(z) is on Ei and so by Lemma 7.2 the roots 
w of = q{z,w) — ^(z) — r]{w) are situated one outside and one inside the unit 
disk. Since /3 is the larger of a and (3, a{z)c/d is the root inside the disk. 

On the other hand ifa< b + c + d but not both a — b and c = d, then (a — 6)^ < 
(c + d)^ (and (c — d)^ < (a + 6)^ by the hypothesis that a is the largest of a, b, c, d) 
so the two ellipses intersect as in Figure § (because the places where they cross the 
X-axis are interlaced). Let {zo,wo) = (e*^",e*'^") and {z^,m) = (e"'^°, e"*"^") be 
the roots (satisfying \z\ = \w\ = 1) of q{z,w) = 0, where 9o € (0,7r) (the angle 6 
cannot be or tt since, as we noted, the ellipses are interlaced on the x-axis). Again 
by Lemma |7.2| , for each z with \z\ = 1, exactly one of the roots w — ac/d, l3c/d is 
inside the (closed) unit disk when —6q < 9 < Oq, and for 6 ^ [—0o,9o], both roots 
are outside. We will again take a{z)c/d to be the smaller root. 

In the case a = b+c+d, the two ellipses are tangent, and their single intersection 
point is at z = — l,w = 1, that is, (6*0,00) — (""jO)- 

In the case when both a = b and c = d, the two degenerate ellipses intersect only 
when z = w = — 1, so that the single intersection point is at (^Oi (f'o) = i'^i '"')• 

An important fact about the above four cases is that \P(z)c/d\ > 1 always, and 
\a{z)c/d\ > 1 unless 9 e [-9o,9o]. 

Going back to the case a < b + c + d, at the point (9q, (po), the four quantities 



—=,b^/z^, ——,idy/w^ 



sum to zero by ( 7.16| ), assuming we choose the correct signs for and y/wo- 
They therefore form the edge vectors of a quadrilateral. When taken in the order 
as in Figure ^, the quadrilateral is in fact cyclic since opposite angles sum to tt: 
The (interior) angle between sides aj ^fz^ and ic/ ^/wq is 

TT - arg —— — 

and the (interior) angle between sides b^ 

^id^/w^, 

Summing these two gives angle tt (modulo 27r, of course). 
7.2. The limit of the partition functions. 
Theorem 7.3. The limit 

Z^Z{a,b,c,d) = lim Z^f^"') 

n — >oc 

exists, and 



arg 




2tt 



log = -i^ / / log 



(a + 6e*«)2 {c + de"^f 



^d9. 
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TT - <pQ 




Figure 6. The cyclic quadrilateral formed at (0o,0o) 



Kasteleyn [ Kal computed Z in the special case a — b and c = d. 



Combined with Proposition 7.1, this gives 



Proof. From Proposition 7.1, for positive reals a,b,c,d we have Pg < Z for all £. 



max Pi{a,b, c, d) < Zn{a,b, c, d) < — maxP^(a, 6, c, c?). 



This implies that 



lim = lim (maxP£)i/(2n= 



(assuming these limits exist). Here note that the j for which Pj = max^ may 
also depend on n. 

Let / denote the integral in the statement of the theorem. We show that 
(2n^)~^ logmax^ Pg converges to /. 
Let 

(7.17) F{d, (f>) = g(e'^ e"^) = a^e"*'' + 2ab + b^e'" + c^e"** + 2cd + d^e'"^. 
We then have 

P^=±\{F 

3,k 



27rj 2-Kk 
n ' n 



and similar expressions for P2, P3, and P4 (see ([7.9D-(7.12)). The quantity n ^ log(P£) 
is a Riemann sum for the integral of log|P|. The function log|P| is continuous, 
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hence Riemann integrable, on the complement of a small neighborhood of its two 
(possible) singularities (^Oi^o) and (— ^Oj— 0o)i defined in Section 7.1. 

We break the proof into four cases: the case a > b + c + d, the case a < b + c + d 
but not both a = b and c ^ d, the case a = b and c — d, and finally the case 
a = b + c + d. 

li a > b + c + d, then log |F| is continuous everywhere on [0, 27r] x [0, 27r] and so 
the Riemann sums converge to /. 

In the case a < b + c+d but not both a = b and c = d, there are two singularities. 
It suffices to show that the Riemann sums logPf for each £ are small on a small 
neighborhood of the singularities. This is not quite true since for some i the product 
Pe may have a factor in which {6, (/>) lands close to a singularity; as a consequence 
this Pi may be very small. However, we will show that this can happen for at most 
one of the four products Pe- 

For each term in the four products ( [7.9| )-( 7A^ ), {9, 4>) is of the form (jV/n, k'lT/n), 
for integers j', k' . Furthermore for each pair of integers (j', fc'), exactly one of the 
four products has a term with (6', (/)) = {j'-K /n^k'n /n). Thus at most one of the 
four products has a term closer than 7r/(2n) to the singularity [Oq, cjjo). The same 
Pi will have the term closest to the other singularity (— ^o, ^^o)- 

Fix a small constant i5 > and let Us C [0, 27r] x [0, 27r] be the S x (5-neighborhood 
of a singularity. In Us we use the Taylor expansion 



(7.18) 



and by Lemma [7^ below, the ratio of Fg{9o, 4>o) and F^{9o, 4>o) is not real. 
The sum of those terms in logP^ for which {9, <f) G Us is 



4 E iogim'/')i-A 



E log\C,i9 - 9o) + C2{^ - M\ 



for constants Ci = Fg{9o,4'o), C2 — F^{9o, 0o) with C1/C2 ^ M. (We used here the 
fact that log(a; + 0{x'^)) = log(a:) + 0{x) for small x; we then could take the big-0 
term out of the summation because of the 1/n^ factor.) 

To bound this sum, note that \Cix + C2y \ > C3|a:| + C4|y| > C^lx + iy] for some 
positive constants C3 , C4 , C5 since C1/C2 ^ K. We use polar coordinates around 
the singularity. In the annulus around (0o,0o) of inner radius K/{2n) and outer 
radius {K + l)/(2n), there are at most constant • K points {9, (f) which contribute 
to the sum, and each such point contributes < |n~^ log(constant • K/{2n))\ to the 
sum. Therefore the sum on Us (for those Pj without terms within 7r/(2n) of the 
singularity) is bounded by 



constant 



l<K<5n 



K 



1 ^ 
log — 

n 



= 0((52 log (5). 



For the Pj which does have a term closer than 7r/(2n) to the singularity, the 
above calculations give an upper bound on (2n^)~^ log-Pj. (Including in the factor 
close to the singularity only decreases the product.) Thus we have shown that 
(2n^)~^ logmax^ converges to /. This proves the convergence of (2n^)~^logZ 
to /. 
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In the case a = b and c = d we have only one singularity (tt, tt), and Pi — 0. For 
the other P^'s, when (6*, (/>) is close to (tt, tt) we have 

F{e,(l)) = a2(2 + 2cos0) + c2(2 + 2cos(?i) 

= a^{e - + c^[c^ - + 0{{e - ^Y, {(j) - nf). 

An argument similar to the previous case holds: on Ug we have 



(0,<t>)eUs 



J2 \og\a^i9-7rf +c'^i(l)-nf 

Now log |a^x^ + c^y^l < log \C{x'^ + Summing over annuli as before gives the 
bound. 

Finally, suppose a = b + c + d. For any S > we have 

Zn{a + S, 6, c, d) > Zn{a, b, c, d) > Zn(a — 5, 6, c, d) 

(recall that coefficients of the polynomial Z„ are non- negative) . For fixed 5, the 
limits 

lim Z„(a±,5,6,c,d)i/(2n^) 



both exist, and (as we shall see in (7.20) below) converge to the same value as 
5-^0. Thus 

lim Z„(a,6,c,d)i/(2"') 



exists and converges to this same value. This completes the proof. 



□ 



The integral in Theorem 7.3 can be written more usefully as follows. As before 



set 



r = r[z) — cd + 



{a + bzf 



We can evaluate the first integral (the integral with respect to (j)) in / as follows. 
We have (with w — e^'^) 



1 



log 



+ 2rw + d^w'^ 



d4) 



2^ 7o 



log I {dw — ac) {dw — (3c) \ 



where a,/3 are chosen as in Subsection 7.1 
Using the identity 



1 



log |t + se 



i4>\ 



\og\t\ if |t| > |s|, and 
log|s| if|s|>|i| 



(note that the logarithmic singularity in the case s = t makes no contribution to 
the integral), wc find (with z = e**) 



47r log Z = 



log ddO 



cP(z)\<d 



\ca{z) I <d 

log dde 



log |cQ!(z)| d9 



ca(z) \>d 

log|c/3(z)| de. 



\cf3{z)\>d 



From Lemma [7^ we know that c|/3(2:)| > d for all z on the unit circle. If 
a > b + c + d, then |ca(z)| < d for all \z\ — 1 and so 



Att log Z — 2tt log d + 2tt log c 



log\m\d0 
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(7.19) 



log ^ = ^ log + ^ log c + ^ 



log\(3{z)\de. 



If a < 6 + c + then recall |cq;(2;)| < d if and only if ^ G (— ^o,^o)- Thus we have 
47rlogZ= / \ogdde+ / log|cQ;(z)| ^6*+ / \og\cf3{z)\ d9, 



which gives (using af3 =1) 
(7.20) logZ = |^logd 



iog\p{z)\de. 



Comparing (7.19) and ( [7.20 ), we see that (7.20) holds for all a, 6, c, d, as long as 
we define 9q — n when a > b + c + d. 



Lemma 7.4. For the function F of { 7.11 }, the ratio Fe{9o, (po) / F^{9q, (po) ^ 
unless a = b + c-\- d or both a = b and c ^ d. 



Proof. We must show 
(7.21) 



_Q2g-ieo _|_ 52gieo 



For this proof only, let e*^ and e**^ be square roots of e'^" , e"^" , respectively, with 
signs chosen so that 



(7.22) 



-ie 



+ be'^ +i{ce-'^ + de'^) = 



(cf. (7.16)). We can then factor (7.21) as 



(de 



ce 



-i<p 



) {de"^ + ce-'-^) 



and the second quotient is i. That is, we are left to show that 

J I (c? — c)i cos 4> — (c? + c) sin A _^ q 
[b — a) cos 9 + i{b + a) sin 9 J 



Separating the real and imaginary parts of (7.22) we have 

(7.23) (a + b) cos 6* + (c - d) sin = 0, and 

(7.24) (-a + 6) sin 6* + (c + d) cos = 0. 

Solving these for sin (/), cos and plugging in to the above gives 



Im 



\d- 



CH 



c+d ' 



(d + c) 



a+b 
d—c 



{b — a) cos9 + i{b + a) sai9 



which is zero only if the real and imaginary parts of the numerator and denominator 
are in proportion: either sin 9 cos = or (clearing denominators) 

(d - cf{a - bf = (d + cf{a + bf . 

Neither of these is possible (recall that 9o = tt only when a = b and c = d), so the 
proof is complete. □ 
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8. The Edge-Inclusion Probabilities 



Let fj,n denote the measure on matchings of G„, where each matching has weight 
which is the product of the edge weights of its matched edges. 

The expected number of a-edges occurring in a /i„-random matching is simply 

a dZn 



E(iVa) = 



Zn da 



(this follows from the definition of /i„). From (7.13), the probability Pa{n) of a 
particular a-edge occurring in a /i„-randomly chosen matching is therefore 



(8.1) 



Pa{n) 



2n2 -Pi + P2 + P3 
From ( [7.12 ) we obtain (assuming Pi (a, b, c, d) ^ 0) 



(8.2) 



d_ 

da 



p.-p.Y. 



2(6 + ae 



-2lTij / 71 



^ (a + 5g27rij/ri'j2g-2irij/n _|_ (^g _j_ ^g2irifc/n ^2g-27rifc/n ' 



Note that this holds for all values of a, b, c, d for which Pi (a, 6, c, d) ^ 0, indepen- 
dently of whether or not a > b + c + d. Similar expressions hold for P2 , P3 , P4 . 

In what follows we may no longer assume that a is the largest of 6, c, d since we 
are computing a non-symmetric function pa- Recall that the quadruple (a, 6, c, d) 
determines two possible singularities ±(6*0, 4>o) of the function F of (7.17). 

We will define a set C N, depending on a, 6, c, d, on which our remaining 
convergence arguments work. If one of a,b,c,d is greater than the sum of the 
others, or if both a = b and c = d, take = N. If one of a, 6, c, d equals the sum 
of the others, we define W below in the proof of Proposition ^.2[ . In the remaining 
case, there are two distinct singularities ±{9o,(j)o), where € (0,7r). If 9o ^ tt/2, 
let W be the set of n for which 9o/tt is not well approximated by rationals of 
denominator n, in the following sense: for all integers j we have 

1 



n 



> 



,3/2- 



If ^0 = i'/2, define W as above using ipo instead: note that 9o and (po cannot both 
equal tt/2, for F{tt/2, 7r/2) = —i{a + bi)'^ — i{c + di)'^ cannot be zero (its real part 
is 2ab + 2cd). 

Lemma 8.1. When 9q G (0,7r), for any sufficiently large even n, one ofn,n + 2 is 
in W. 



Proof. Without loss of generality 9q ^. 

n 
and 



70 



Suppose 
1 

< 



?0 



TTJ 



n + 2 

Then j' must be equal to one of j, j 



< 



1 



(n + 2)3/2 ■ 

1 or j + 2; but then 



TTJ 

n 

a contradiction. 



i{2j, \2j 



n{n + 2) 



7i|, |2j — 2n|} ^ constant 



□ 
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Proposition 8.2. If none ofa,b,c,d equals the sum of the others, then for n tend- 
ing to oo in W, the edge-inclusion probability pa{n) converges to 

'■^'^ '-^'^ {b + ae-''^) d(j> d9 



^^■^^ 47r2 7o J„ (a + 6e»»)2e-*« + (c + de»'^)2e- 

// one of a,b, c, d equals the sum of the other three, then Pa{n) converges to the 
above integral along a subsequence W containing at least one of each pair n, n + 2 
with n even. 



Proof. We will show that the sum 
a d 



(8.4) 



2n'^ da 



log Pi 



a 
2^ 



E 

e,<t> 



2(6 



converges to the desired integral 



3), where the sum is over (6, (j)) — (27rj'/n, 27rfc/n). 

The value (8.1) is then a weighted av- 



Similar arguments hold for P2, P3, and P4. 
erage of these sums, with weights ±Pf/(2Z„). Since Zn > Pg > (see Proposition 
7.1), the weights are bounded in absolute value (less than 1/2) and sum to 1. 
Therefore the weighted average also converges to ( ^.3[ ). 

We separate the proof into four cases. In the first case, where one of a, 6, c, d 
is greater than the sum of the other three, there are no singularities {F{d, cj)) is 
never zero), so the summand is a continuous function on [0, 27r]2. Therefore (^.41) 
converges to the integral in 



.3) 



For the second case, suppose each of a, 6, c, d is strictly less than the sum of the 
others, but we are not in the case where both a = b and c = d. Since n G W, 
none of the four Pg can have a term with {9, (f>) within n^^^^ of a singularity. We 
claim that the sum (8.4) co nver ges to the integral ( |8.3D . This is proved in the 
same manner as in Theorem 7^: one needs only check that the contribution on 
a small neighborhood of the singularities is small. As before, let Us he a. 6 x S- 
neighborhood of a s ingu larit y. Ign ore for a moment the single term closest to the 
singularity. Lemma 7_A and ( 7.18 ) give us the estimate 



(8.5) 



a 
2^ 



E 

{0.<t>)eUs 



2(6 + ae-'^) 



Fie,, 



-ie\ 



2a s—^ \b + ae , 

-2^jj^\Ci{e-eo) + C2{(t>-cPo) + o{{e-eorAc^-(Por)y 

for constants Ci Fe{9Q, <j>o),C2 = F^{9o, (f>o) where C1/C2 ^ M. Now 
\Ci{0 - Oo) + C2{<P - M\ > CaK^ - 0o) + i{<jy - 0o)| > C4 max{|0 - 0o|, 10 - 
for some positive constants C3, C4. Using l/{x + 0{x'^)) — l/x + 0(1) for small x, 
where x = CsK^ — 0o) + *(0 — 0o)li we get the bound 



2a \ ^ 

2^^ 



Us 



|6 + ae-'''| 



|Ci(0-0o)+O2(0-</io)| 



0{l] 



Taking the 0(1) term out of the summation turns it into a 0{5^). 

Summing over annuli concentric about the singularity, we may bound the left- 
hand side of ( ^.5[ ) by 

constant y . « ^ ^^^^^ 

l<K<6n 
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The single term closest to the singularity contributes a negligible amount 

constant n /o 



In the third case, when both a = b and c = d, we have (6*0, ^io) — (tt, tt). Then 
Pi = and the other Pi are nonzero. Furthermore the pairs {9, (f) appearing in 
the products for P2,-P3,P4 do not come within distance 7r/n of the singularity. 
Since near (a,6, c, d), Pi is a polynomial taking non-negative values which is zero 
at {a,b,c,d), it must have a double root there. Thus its derivative with respect to 



a is zero at {a,b,c,d). We can therefore remove Pi from the expression (S.l) and 



just deal with the remaining three Pg. When a = b and c = d, (pM becomes 



2a 
2^ 



^ a2(2 + 2cos6') + c2(2 + 2cos( 

Us 



2a^ 
- 2^ 



\t{e-n)\ 



^ |a2(6l-7r)2 +c2(</)-7r)2| 

. Us 
3\ 



+ 0{S 

where we used l/{x^ + 0{x'^)) = + 0{l). This is 0{S) (sum over annuh as 
before) . 

Finally we consider the case when one of a, 6, c, d is equal to the sum of the 
other three. Suppose first that a = b + c + d. Note that pa{n){a,b,c,d) is a 
monotonic increasing function of a: the expected number of a-edges increases with 
their relative weight. For each S > sufficiently small, choose n so that 

\Pa{n){a - S, b, c, d) - Pa{a - S, b, c,d)\ < S. 

Such an n exists because (a — S, b, c, d) is in the domain of case two, above. By 
monotonicity 

Pa{n){a, b, c, d) > Pa{n){a - (5, b, c, d) > Pa{a - S, b, c,d) - S 

for this n. Take a sequence of 5's tending to 0. On the corresponding sequence of 
n's, Pa{n){a, b , c, d ) tends to 1, which is equal to the value of the integral Pa(a, b, c, d) 
(see Theorem |8 . 3| below) . The set W in this case is obtained from the concatenation 
of the appropriate subintervals of the sets W that are defined for each quadruple 
(a — 5, 6, c, d). 

Since pa{n){a, 5, c, c?) — > 1 we must have that 

Pb{n){a,b,c,d) 0, 
Pc{n){a,b,c,d) 0, 
Pd{n){a,b,c,d) 0. 

This (and symmetry) takes care of the remaining cases. □ 
A rather lengthy calculation yields the following result: 

Theorem 8.3. If a > b + c + d, then pa — 1- If one of b, c, d is larger than the 
sum of the other three of {a, 6, c, d}, then pa = 0. Otherwise, let Q be a cyclic 
quadrilateral with edge lengths a,c,b,d in cyclic order. Then p a is l/(27r) times the 
angle of the arc cut off by the edge a of Q. That is, 

(8.6) 

1 . _i / a J (a + b + c - d){a + b - c + d){a - b + c + d){-a + b + c + d)\ 
Pa = -sm ' 



TT 



2y^{ab + cd){ac + bd) {ad + be) 
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(Here the branch of the arcsine that is needed is the one given by the preceding 
geometrical condition.) 

The proof is in Section |l^. Note that by this theorem, in the case of non-extremal 
tilt, the 4-tuple 

(sin(7rpa), sin(7rp{,), sin(7rpc), sin(7rprf)) 
is proportional to (a,6, c, d). Since a constant of proportionality has no effect on 
the measures /z„, we can assume that a = sin(7rpa), etc. 

We also note a simple relation between the singularity (0o,0o) and the edge- 
inclusion probabilities (hereafter simply calle d edge probabilities), which will be 
useful later. From Figure ^ and Theorem B.3, when a > b and c > d we find 

(8.7) 6*0 = vr- 7r(pc -Pd), and 

(8.8) (f)0 = TT - 7r{pa ~ Pb)- 

We now bound the variance of Na, the number of a-edges in a matching. 

Proposition 8.4. For all a, b, c, d and for n g W, a'^{Na) = o(n^). 

Proof. The graph G„ has 2n'^ a-edges. For k € [1, 2n^] let qk be the {0, l}-valued 
random variable indicating the presence of the fcth a-edge in a random matching. 
Then Na = qi + ■ ■ ■ + q2n^ , and so 

(8.9) a^{Na) = ^a2(gfc) +^(E(gfeq,) - E(qfe)E(<?,)). 

k k^e 

We have E(gj,) — Pa{n) and so cr^(gfc) — Pa{n) —Pa{n)^. In the case when one of 
a, 5, c, d is greater than the sum of the others, we know from Proposition 3.2 that 
Pa converges to 1 or 0; as a consequence a'^{qk) — > 0, and the covariances converge 
to also, so a'^{Na) — o(n^) as well. Similarly when one of a, 6, c, d equals the sum 
of the others; then pa converges to 1 or along VF, and so a'^{Na) is o(n^) on this 
same subsequence. 

The remaining cases require more work. By (a straightforward extension of) 
Theorem 6 of [Kel], we have 

(8.10) 

^^"'"'^ - .p,+p, + p, + p, ' 

where 

' ^ ^ V A-'ive^Wk) A '-{ve,Wi) J' 

and where Vk , Wk are the vertices of the edge associated with qk {vk being the left 
vertex) and vg, wt the vertices of the edge associated with qt^ {vi being the right 
vertex). The inverses A~^ are only defined when the corresponding Pi are nonzero. 

When n £W tends to oo the diagonal entries A~^{vk,W k), A ~^{vi,we) tend to 
Pa (see Proposition 8.2). Writing each 2x2 determinant in ( |8.1C| ) as the product of 
the two diagonal entries minus the product of the off-diagonal entries, the diagonal 
entries can be taken out of the quotient in (3.10) and contribute p^ + o(l). It 
remains to estimate the contribution of the off-diagonal entries. 

We can compute the inverses of the Ai as follows. Note that from (7.4) we have 



Di 



I'''- \ Do 
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where 



1 



b + az ^ —i{d + cw ^) 
-i{c + dw'') a + bz^ 



(a + bzi)^z + (c + dw'^)w 

(we won't need the expression for 02). Recall the definition of the matrix 5 of 
([7^). Let Sx^y^s be the vector 



We have 



1 if {j,k,t) = {x,y,s), and 
otherwise. 

l^~2^tjx/n^~2niky/n jf ^ ^ 5 and 







otherwise. 



From (7.3) we therefore find, for example, 
(8.11) 



Ari((0,0,l),(x,2/,3)) 
and 



■EE 



-27r i{jx-\-ky) /n 



{b + ae 



-27rij /n 



_|_ /n"j2Q—27Tij /n _|_ _|_ ^^2-Kik/n'^2^~27rik/n 



A-H{Q,0,l),{x,yA))^-J2Yl 



^ / ^ _j_ Jj^2'Kij/n^2^~2TTij/n _j_ _j_ ^^27rik/n^2^ — 2'jTik/n ' 

We also have Aj^^((0, 0, 1), (x, y, <)) = when t = 1 or t = 2. Similar expressions 
hold for inverses of ^2,^3,^14. 

An argument identical to the proof of Proposition 3.2 (the only difference is the 
factor which has modulus 1) shows that the parts of the sums ( ^.11 ) 

over a ^-neighborhood Us of the singularities are 0{S). 

We will show that for all y with (1 — e)n > y > en (later we will set e = n^^/'*) 
the value A^^{{0, 0, 1), {x, y, 3)) tends to zero as n — > 00 in Similar results hold 
for A2, A3, and A4. For si mplici ty of notation let 0,v denote the vertices (0,0, 1) 
and (a;,y,3). The equation ( ^.ll| ) has the form 

A-\0,v) - Ve-2-(^--+'=^)/"Gi(j7n,A:/n), 

where Gi is a smooth function on the complement of the region Us- We already 
know that the sum over Us is 0(5), so let us replace Gi by a new function G2 which 
agrees with Gi outside Us and is zero on Us- We sum by parts over the variable k 
to get 



A^\Q,v) 



1 



EE 

j=0 k=Q 



E< 



-27ri£y /n 



G2 



1 



n— 1 / n—1 

EE' 



n ' n 



'27riiy /n 



G2 



G2 



J k+1 



j=o \e=o J 
Since (1 — e)n > y > sn, the sum over £ of the exponentials is 



0(5). 



1 _ „-27ri(fe+l)y/n 
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for each k. The difference \G2{j /n, k/n) — G2{j/n, (k + is bounded by 1/n 

times the supremum of \dGi/dy\ on the complement of Us^ except that at the points 
adjacent to the boundary of Us, the difference is bounded by the supremum of |Gi| 
near the boundary. 

One can check (see the proof of Proposition |8.2| ) that the sup of \dGi/dy\ on the 
complement of Us is 0(5~'^), and the supremum of \Gi\ on the boundary of Us is 
0{5~^). Only 0{5n) pairs {j,k) correspond to points adjacent to the boundary of 
Us, so we have 

\A-,\Q,v)\ < ±lj2Y.O{e-')0{S-^)-]+^0{n6)0{S-')0{e~') 



3 

Choosing S — e = rC^I^, we have A^^(0, w) = 0(n~^/^). 

A similar argument holds in the case where both a — b and c — d. From ( ^.9|) 
we have 

4 

a^{Na)<n^O{l) + n^o{l)+ ^ J2\^^^('"k,w,)A~\v,,Wk)\, 

edges k^i i—1 

but A~^{vk,we) and A~^{vi,Wk) are 0(ri~^/^) as soon as the edges k and £ are 
separated by at least en = v?^'^ in their y-coordinates. Therefore (using e — n"^/**) 

a'^{Na) = 0{n^) + o{n^) + 0{n^ ■ en^) + 0{n^n-^^^) ^ o(n4). 

Here the term 0{n^ ■ en^) comes from edges whose y coordinate is less than en or 
greater than (1 — e)n, and the term 0{n^n~^^'^) consists of the remaining pairs of 
edges. As a consequence cr^(iVa) — o{n^). □ 



One can show from this argument (using the result of |Kcl ) that for n <eW the 
measures /i„ converge weakly to a measure on the set of matchings on Z^. The 
result is as follows. 

Define 



(a + 6e^'')2e-*« + (c + de'^'t'Ye-^'t' 



and 

y^'-J^) 4^2 (a + 6e'»)2e-»» + (c + de'^)2e-^^' 

Also, define a colored configuration of dominos as a configuration of dominos with 
a checkerboard coloring of the underlying square grid. 

Proposition 8.5. As n — > oo within W, the probability of finding a certain col- 
ored configuration of dominos in an (a, b, c, d) -weighted n x n torus converges to 
|w;detM|, where w is the product of the weights of those dominos and Mi,j — 
P{vij), with Vij e the displacement from the i-th white square to the j-th black 
one. 
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Ben Wieland has pointed out that when ab — cd, one can write this more sim- 
ply. Define P'{2x + l,2y) = c{a/bY{c/ d)y P{2x + l,2y) and P'{2x,2y + 1) = 
c{a/b)^{c/dyP{2x, 2y + 1). One can check that if ab = cd, then the probabihty we 
seek is simply the determinant of the matrix with entries P'{vij), with no need to 
multiply by the products of the weights of the included dominos. This formulation 
makes the conditional uniformity immediately apparent. 



9. The Entropy 

9.1. Entropy as a function of edge-inclusion probabilities. Since the set of 
matchings on G„ is finite, the entropy of a measure /i on the set of matchings is 
simply 

M 

where the sum is over all matchings M and fJ.{M) is the probability of M occurring 
for the measure /i. The entropy per dimer is by definition 

ent(M) ^ 

(recall that a matching of Gn has 2n^ matched edges). 



Recall that for real z, L{z) is the Lobachevsky function, defined by (1.3). 
Proposition 9.1. As n — > oo in W , ent(/x„) converges to 

(9.1) ent(a, 6, c, d) = - {L{Trpa) + L{TTpb) + L{ttPc) + L{TTpd)) , 

TT 



where Pa, Pb,Pc,Pd are given by (|^ 
Proof. Let C{Na, Ni,,Nc, Nd) denote the coefiicient of 

in Zn- 

As we computed earlier, on the toroidal graph G„ the /i„-probability of an a-edge 
(resp. b-, C-, d-edge) is given by Pain) (resp. Pb{n), Pc{n), pd{n)). The expected 

number of a-edges is Na =^E(A^a) = 2v?pa{n). 
Let 

U, = {{Na, Nb, N„ Nd) : \Na - Na\ < sNa, \Nb - Nb\ < sNb, 
\Nc - Nc\ < eNc, \Nd - Nd\ < eNd}. 

Let Vf; be the corresponding set of matchings, i.e., those where the corresponding 
quadruples {Na, Nb, Nc, Nd) are in 11^. Because the variance is o(n'*), for all e, e' > 
there exists hq such that for n € W greater than hq we have 



J2C{Na,Nb,N„Nd)a^-b^'c^^d^^ > {1 - e')Z„{a,b,c,d). 
Note that if pj are probabilities and pi + ■ ■ ■ + pk < s' < 1, then 
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(since the left-hand side is maximized when the pj 's are equal) . Thus for the entropy 
we may write 

Hi^i) = - J2 M(M)logA*(Af)- ^l{M)\og^,{M) 

= - ^ ^,{M)\og^,{M)- MM) log I 

= O fe' log 



constant'^ 



Zn 



but for Af G 14, we have log(a 



) < £log(a "), and similarly for b, c, d, so 



log{a^'b^'c^-d^^){l + 0(e)) + log Z„ +Oin^e' loge') 



Note also that X^MeVe l^i-^) — 1 ^ Letting e, e' ^ as n ^ 00 we have 
finally that the limiting entropy per dimer is 

ent(a, b, c, d) = lim ^ flog Z„(a, b, c, d) - log(a^"6^''c^'^d^'' 

= log Z{a, b, c, d) - Pa log(a) - pb \og{b) - log(c) - pd log(d) 

Without loss of generality we may assume that a > b and c > d; then from (| 
we have 9o — tt — tt(j>c — Pd)- Plugging in from ( 7.2C ) now gives 

(9.2) 

1 

ent(a, 6, c, d) = — / \og\(3iz)\ dd - 



l-Pc~Pd 



log(cd) - Pa log(a) - pb log(5). 



To prove the equivalence of this formula and (9.1 ), we show that they agree when 
a — b ^ c = d, and show that their partial derivatives are equal for all a, b, c, d. 
Formula (|9.2[) gives 



ent(l, 1,1,1) = J log (2 + cos 61 + ^/{2 + cos Of - l) d0 



1 

in 



2 log cos 



1 de. 



This is two times the value of the entropy per site given in formula (17) of |Kal|, 
as it should be since the entropy ent(l, 1,1,1) as we defined it is the entropy per 
dimer. Kasteleyn also shows that this value equals 2G/tt, where G is Catalan's 
constant 

1 1 



G = 1 



From the expansion 



1 

32 ^ 52 " 72 ^ 
sin(2fca;) 



2 ^ fc2 

fe=l 



(see Q) we have 2G/7: = (4/7r)L(7r/4), so the two formulas agree when a = b = 
c — d. 
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It remains to compute the derivatives. Equation (9.1) is symmetric under the 
full symmetry group 5*4, and ( |9.2| ) is by definition symmetric under the operations 
of exchanging under the operations of exchanging a and b, exchanging c and d, and 
exchanging a, b with c, d. These operations are transitive on {a, b, c, d}, so it suffices 
to show equality of the derivatives with respect to a. We have 

d d 

—ent{a,b,c,d) = — (log Z - log(a) - log(6) - log(c) - pd log(d)) 

(recall that Pa — {a/Z)dZ/da). 

On the other hand when x is one of a, 6, c, d we have 

— -L{ttPx) = - log(2sin(7rp2;))— ^. 
oa TT oa 



Taking d/da of (9.1) and recalling that a — sin(7rpci), etc., gives 



Since 



log(2a)— - log(26)^ - - log(2d) — . 



d 

l0g(2) — (pa + P6 + Pc + Pd) = 0, 



the proof is complete. □ 



As explained in the proof of Theorem 4.1, the entropy converges for all n, not 
just for n £ W: 

Theorem 9.2. As n — > cx) the entropy per edge of matchings on G„ of the measure 
^n{a,b,c,d) converges to 

ent(a, 6, c, d) = - {L{npa) + L{Tipb) + i(7rpc) + L{iTpd)) , 

TT 



where Pa, Pb,Pc,Pd are given by (8.t) 



Intriguingly, this formula can be used to show that when each of a, 6, c, d is less 
than the sum of the others (the only case in which the entropy ent(s, t) is non-zero), 
ent(s,t) is equal to I/tt times the volume of a three-dimensional ideal hyperbolic 
pyramid, whose vertices in the upper-half-space model are the vertex at infinity 
and the four vertices of the cyclic quadrilateral of Euclidean edge lengths a, c, b, d 
in cyclic order (otherwise the entropy is 0). We have no conceptual explanation for 
this coincidence. 

We do not even fully understand why the limiting behavior of the measures /i„, 
viewed as a function of a, 6, c, d, turns out to be symmetrical in its four arguments. 
This symmetry is not merely combinatorial, since it emerges only in the limit as the 
size of the torus goes to infinity. R. Baxter suggests (in personal communication) 
that it is almost certainly the same symmetry that occurs in the checkerboard 
Ising model. In that setting it can be proved using the Yang-Baxter relation (see 
|jMi Q). 



A VARIATIONAL PRINCIPLE FOR DOMINO TILINGS 



43 



9.2. Entropy as a function of tilt. Given a tilt {s,t) satisfying \s\ + \t\ < 2, we 
claim that there is a unique (up to scale) 4-tuple of weights a, 6, c, d satisfying the 
conditional uniformity property ab = cd and such that the average tilt of fj.a,b,c,d 
is (s,i). To d etermine a,b,c,d, we note that pa,Pb,Pc,Pd are determined by the 
equations ( |l.4| )-(p^). To solve these equations, note that (1.7) can be written 

cos(7r(pa - pb)) - cos(7r(pa + pb)) = cos(7r(pc - Pd)) - cos(7r(pc + Pd)), 

and so 

cos(7rt/2) - cos(7r(pa + Pb)) = cos(7rs/2) - cos(7r - 7r(pa + Pb)), 

giving 

2cos(7r(pa +Pb)) — cos(7ri/2) — cos(7rs/2). 

This combined with ( |l.4| ) gives (^T^), where the values of cos~^ are taken from [0, tt] 
(to see why, notice that cos~-^((cos(7ri/2) — cos(7rs/2))/2) — n{pa + Pb), which is 
between and tt). 

Finally we can determine a, b, c, d as functions of the tilt by 

a = sininpa), 5 = sin(7rpb), c = sin(7rpc), d = sin{Trpd). 

10. Concavity of the Entropy 

Theorem 10.1. The entropy per edge ent(s, t) is a strictly concave function of s, t 
over the range \s\ + \t\ < 2. 

Proof. We show that the Hessian (matrix of second derivatives) is negative definite, 
that is, entss{s,t) < 0, enttt(s,i) < 0, and ent s s {s, t) ent tt {s, t) — entst{s, t)"^ > 0, for 
all s,t, except at the four points {s,t) = (±2,0) or (0,±2). 
A computation using Theorem and equations gives 

dent dpa <9ent dpd 



entt(s, t) 



A second differentiation yields 
9^ent(s, t) 



dpa dt dpd dt 

1 . / sin(7rpa) \ 



4 ^[^sm{TTpb)J 



dt"^ 32 sin(7r(pa + pb)) sin(7rpa) sin(7rpb) 



sm' 



2 ^ (cos(f )+C0s(f ))^ 



and this quantity is strictly negative except at the points {s,t) ~ (±2,0), (0,±2). 
A similar calculation holds for entss(s,t): 

9^ent(s,i) — tt 



ds"^ 32 sin(7r(pa + Pb)) sin(7rpQ) sin(7rpb) 

2 /ttA , (cos(f )+cos(f))2 



sm 



We have 



entst(s,i) 



2 J 2 
sin(7rs/2) sin(7rt/2) 



32 sin(7rpa) sin(7rph) sin(7r(pa + pb)) ' 
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Finally, 



entssenttt - ent^^ 



32 sin(7r(pa + pb)) sin(7rpa) sin(7rpb) 



sm 



TTt 



(cos(^) +cos(f )) 



(cos(^) +cos(f ))^ 



• 2 ■ 

3m (-jsir 



which is clearly positive. 



□ 



11. Proof of Theorem 8.3 



In what follows, we must be careful to distinguish the differential dw from the 
product d ■ w (we will write the product as wd to avoid confusion). Let z = e'^, 
w = e'"^ and r{z) = cd+ {a + bzY/{2z) as before. Then (see ( |7.14| )) 

o- f f w{b + a/z) dw dz 
Att'^ J gi J gi + 2rw + w-^d"^ iw iz 



Pa = 



4^ 



(6 + a/ z)dz 



dw 



Igi {wd — ac){wd — (3c) 

Recall (see the second-to-last paragraph of Subsection ^j] ) that when \z\ = 1, 
|/3(z)c| > d always, and |q;(z)c| < c? if and only if e {-~6q,6q) (remember that 
we defined 6q — n the case a > 6 + c + d). If |a(z)c| < d, then the residue of 
{{wd — ac){wd — (3c))~^ is 



{a — (3)cd 



Thus we have 



Pa 



-a 2m r"-'^" b + a/z 
47r2 cd Jg^_e^ z{a - P) 



dz 



{bz + a) dz 



and recalling the definition of r and simplifying yields 



Pa 



dz 



27r Je=-eo zyj{a + bz^ + AzcA 

We don't have to worry about keeping track of the sign of the square root since we 
know that we want Pa > 0. In fact we only need to be careful about the sign when 
we get to ( 11.1 ), below. 

This integral can be explicitly evaluated, giving 



log 



+ {ab + 2cd)z + a^J{a + bz^ + Azcd 



This expression can be simplified using the variable (or rather, one of the two 
variables) w = w{z) such that (a -I- 6z)^/z + (c + wd)"^ /w = 0: the expression under 
the square root is then 

{c + wd)^ 



(a -I- bz) + Azcd - 



Acd 



'-{c^iDd)' 
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Plugging this in yields 



Pa = TT- 



2^ 



2^ 



log \- ab + 2cd + a(c — wd) 



log 2cd + ^ <^ ^ + by/z + i —= + d^fw \ - 2ida^ 



(11.1) 

Up until ( |1 1 . 1| ) , changing the sign of the square root will only change the sign of 
the integral. In (|l 1 . 1 ) , we choose the sign of y/w/z (or, wha t is th e same, the sign 
of y/wz) so that the expression in curly brackets is zero (cf. (|7.16|) ). We then have 



log -2d 



Iw 
ia\ I — 



Had we chosen the other sign we would have gotten a similar expression with c and 
d interchanged. 

But now Figure ^ (whose lower quadrilateral is a rotation of Figure ^, and whose 
upper quadrilateral is the reflection of the lower across the edge c) shows that, as 
runs from —Oq to 6*0, the quantity — c + iayjwj z sweeps out an angle of Qa-, the 
angle of arc cut off by edge a in a cyclic quadrilateral of edge lengths c, a, d, h. Thus 
Va = ^a/ (2vr). In the case a > 5 + c + d, one can similarly show that — c + ia^ w/ z 




Figure 7. The integral defining pa- Note that the angle between 
the two dotted rays at the origin is 9 a- (Recall that 9a — 2Tipa is 
the angle of arc cut off by the edge a.) 



sweeps out an angle of 27r, and t hus Pa — 1. 

Finally, to prove the formula (8.6), recall the well-known formula for the radius 
r of the circumcircle of a triangle of sides a, 6, c: 

(11 2) = 

{a + b + c){a + b-c){a-b + c}{-a + b + c) 
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From this one can compute, for a cyclic quadrilateral of sides a, c, b, d, the length s 
of the diagonal having the b and c edges on the same side: 



[ab + cd) [ac + bd) 
ad + be 



Plugging this value back into the formula ( 11.2 ) with a,b,c replaced with a,d,s 
gives 

2 {ab + cd) {ac + bd) {ad + be) 

^ ^ {a + b + c- d){a + 6 - c + d){a - 6 + c + d)(-a + 6 + c + rf) ' 
from which ( ^.6| ) follows by Pa — 7r^^sin^^(a/(2r)). 

12. The PDE 

Under the assumption that the entropy-maximizing function / is , the Euler- 
Lagrange equation for / is 

^(ent, (/,,/,)) + A(entt(/,,/^)) = 0, 

where ents , entt are the partial derivatives with respect to the first and second 
variable, respectively. This equation holds only at points where the tilt {fx,fy) is 
non-extremal, i.e., satisfies \fx\ + \fy\ < 2; otherwise, perturbing / will mess up the 
Lipschitz condition. In case / is only C^, this equation still holds in a distributional 
sense: it is true when integrated against any smooth test function g vanishing on 
the boundary (and such that / -I- is 2-Li pschitz for sufficiently small e > 0). In 



such a case / is called a weak solution [ GT [ 



We computed in the proof of Theorem 10.1 that 



1 . / sin(7rprf) \ 



ents{s,t) -7 log V ' 

4 \sm{TTpc) J 



1 , / s in(7rpa) \ 



(12.1) entt{s,t) = - log 

4 \sm{7Tpt,) J 



Plugging in from (L8) and simplifying yields the following PDE: 

Theorem 12.1. At the points where the entropy-maximizing function f is and 
has non-extremal tilt, it satisfies the PDE 

(l-i?^)-sin2(^))/.. + 2sin(^)sin(^)/., 

^2 (1 - - sin^ 
where D = ^ (cos {'Kfx/2) — cos {■Kfy/2)). 

13. Conjectures and Open Problems 

In this article, Pa^ Pb, Pc, and pd were defined in relation to the dimer model 
on an n X n torus, in the thermodynamic limit as n — > 00. We have proved 
no corresponding interpretation of these quantities for the thermodynamic limit 
of planar regions. However, our results on asymptotic height function associated 
with large regions imply that, in a patch of a large region where the associated 
asymptotic height function / satisfies {df /dx,df /dy) = {s,t), the local density of 
a-edges minus the local density of 6-edges equals Pa ^ Pb, and likewise for the c- 
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and d-edges. To be precise here, one would define local densities as averages over 
mesoscopic patches within the tiling (which we recall are defined as patches whose 
absolute size goes to infinity but whose relative size goes to zero). 

Conjecture 13.1. The local densities oj a- edges, b-edges, c-edges, and d-edges are 
given by pa, Pb, Pc, and pd, respectively, in the thermodynamic limit. 

Here our use of the phrase "thermodynamic limit" carries along with it the sup- 
position that we are dealing with an infinite sequence of ever-larger planar regions 
whose normalized boundary height functions converge to some particular bound- 
ary asymptotic height function, and that the subregions we are studying stay away 
from the boundary by at least some mesoscopic distance. 

The local density of a-edges is just the average of the inclusion probabilities of all 
the a-edges within a mesoscopic patch. The authors have empirically observed that 
such averages do not arise from the smoothing out of genuine fluctuations. Rather, 
it seems that apart from degenerate cases, all the a-edges within a patch have 
roughly the same inclusion probability. These degenerate cases occur in patches 
where the asymptotic height function is not smooth (so that (s, t) = {df /dx, df /dy) 
is undefined), or where the tilt is nearly extremal (i.e., \s\ -f \t\ is close to 2). Hence 
we believe: 

Conjecture 13.2. In the thermodynamic limit, the probability of seeing a domino 
in a particular location is given by the suitable member of the ^-tuple {jPa,Ph^Pc^Pd) : 
wherever the tilt {s,t) (given by the partial derivatives of the entropy-maximizing 
height function) is defined and satisfies \s\ + \t\ < 2. 

This is the conjectural interpretation of Pa,Pb,Pc,Pd that was alluded to in Sub- 
section 1.1. We are two removes from being able to prove it in the sense that we 
do not even know how to prove Conjecture 13.1. 

The restriction |s| -|- \t\ < 2 deserves some comment. It is not hard to devise a 
large region composed of long diagonal "herringbones" that has only one tiling (see 



the final section of |CEP]). For such regions, the edge-inclusion probabilities can 
be made to fluctuate erratically between and 1. Such regions have asymptotic 
height functions in which the tilt is extremal everywhere it is defined, so we can 
rule out such behavior on the basis of tilt. 

Incidentally, a conjecture analogous to Conjecture 13. 2| can be made for the 



case of lozenge tilings (which can be studied using the methods of this pape r. 



by setting one edge weight equal to 0). Here the analogue of Conjecture |l3.l| is 
actually a theorem; that is, for lozenges there are only three kinds of orientations of 
tiles, so that their relative frequencies, which jointly have two degrees of freedom, 
both determine and are determined by the local tilt (s, t) of the asymptotic height 
function. 

Straying further into the unknown, we might inquire about the probabilities of 
finite (colored) configurations of tiles. Here again we are guided by the Ansatz that 
what is true for large tori should be true for large finite regions as well. Given 
any tilt (s,t) satisfying \s\ + \t\ < 2, choose weights a,b,c,d satisfying ab = cd 
and giving tilt (s, t) in accordance with our earlier formulas, and use the formula 
in Proposition |8.5| to define a measure ^s,t on the space of domino tilings of the 
plane. This measure is invariant under color-preserving translations and satisfies 
the property of "conditional uniformity" — given any finite region, the conditional 
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distribution upon fixing a tiling of tfie rest of tlie plane is uniform. Proposition B.5 
invites us to surmise: 

Conjecture 13.3. Let \s\ + |t| < 2, and let a,b,c,d be weights satisfying ah — cd 
such that the average height function for weighted torus tilings has tilt (s,f). Then 
for any colored configuration of dominos, the probability of finding it in a specified 
location in a random n x n torus tiling converges as n oo to the value given by 
the measure ^s,t- 



Proposition approaches this claim, but it restricts n to lie in a large subset 
W of the integers. 

The measures fig^t have positive entropy whenever |s| + \t\ < 2. Furthermore, 



the coupling function calculations in the proof of Proposition 3.4 show that these 
measures are mixing (correlations between distant cylinder sets tend to 0) and 
hence ergodic. We believe that these measures can be characterized uniquely by 
the properties mentioned so far, although we cannot prove it. 

Conjecture 13.4. Every ergodic, conditionally uniform measure on the set of 
tilings of the plane that is invariant under color-preserving translations and has 
positive entropy is of the form fi^^t for some (s, t) satisfying \s\ + \t\ < 2. 



Assuming the truth of Conjecture 13.3, it would be natural to advance a further 



claim that would come close to being the final, definitive answer to the question 
Kasteleyn raised nearly four decades ago: 

Conjecture 13.5. In the thermodynamic limit for a sequence of finite regions con- 
verging to a fixed shape, the probability of seeing any colored configuration of domi- 
nos is given by the measure fis,t, wherever the tilt (s,t) given by the variational 
principle is defined and satisfies \s\ + \t\ < 2. 

Finally, we discuss the seemingly miraculous geometric interpretation of our 
formula for the entropy. As was pointed out in Section when each of a, b, c, d 
is less than the sums of the others, the (asymptotic) entropy of torus tilings with 
weights a,b,c,d equals I/tt times the volume of a three-dimensional ideal hyperbolic 
pyramid, whose vertices in the upper-half-space model are the vertex at infinity and 
the four vertices of the cyclic quadrilateral of Euclidean edge lengths a, c, 5, d (in 
cyclic order). Notice that we needn't assume any particular scaling for the weights, 
because homotheties (x, y, z) i— > (rx, ry, rz) are isometries of hyperbolic space. 

Open Problem 13.1. What do tilings have to do with hyperbolic geometry? 

Explaining that connection is one of the most intriguing open problems in this area. 
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